[nzlug] Extracting text from a .cwk doc

Johann Schoonees j.schoonees at irl.cri.nz
Thu Oct 11 16:06:16 NZDT 2007


No :-(  Too many \0 (^@) control characters embedded in the text, and it 
does not translate the quote-like characters (" etc.) and possibly 
others like them.

Johann

Steve Holdoway wrote:
> strings <filename> any use???
> 
> Steve
> 
> On Thu, 11 Oct 2007 12:30:34 +1300
> Johann Schoonees <j.schoonees at irl.cri.nz> wrote:
> 
>> Hi List
>>
>> Apologies, this may be slightly OT.  I have a file with extension .cwk 
>> apparently emailed from a Mac judging by its AppleDouble encoding.  I am 
>> only interested in its text content.
>>
>> The person who sent the file to me seems not to use computers much 
>> because I have not been able to get them to re-send it in any other 
>> format (or even respond to emails).  I don't have access to a Mac and I 
>> don't have their permission to share the document with others.
>>
>> Crudely cutting out the text body in emacs and running that through 
>> mac2unix does produce something vaguely readable, but it is mangled by 
>> control sequences like ^@^@^C^L appearing in the middle of words, and 
>> quotation marks replaced by non-ASCII characters.
>>
>> Can anyone recommend a conversion filter?  Not much luck with Google. 
>> It shouldn't need much more than a small tr or sed script if one knew 
>> the translations.
>>
>> Thanks,
>> Johann


This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.




More information about the NZLUG mailing list