[nzlug] Extracting text from a .cwk doc
Johann Schoonees
j.schoonees at irl.cri.nz
Thu Oct 11 16:06:16 NZDT 2007
No :-( Too many \0 (^@) control characters embedded in the text, and it
does not translate the quote-like characters (" etc.) and possibly
others like them.
Johann
Steve Holdoway wrote:
> strings <filename> any use???
>
> Steve
>
> On Thu, 11 Oct 2007 12:30:34 +1300
> Johann Schoonees <j.schoonees at irl.cri.nz> wrote:
>
>> Hi List
>>
>> Apologies, this may be slightly OT. I have a file with extension .cwk
>> apparently emailed from a Mac judging by its AppleDouble encoding. I am
>> only interested in its text content.
>>
>> The person who sent the file to me seems not to use computers much
>> because I have not been able to get them to re-send it in any other
>> format (or even respond to emails). I don't have access to a Mac and I
>> don't have their permission to share the document with others.
>>
>> Crudely cutting out the text body in emacs and running that through
>> mac2unix does produce something vaguely readable, but it is mangled by
>> control sequences like ^@^@^C^L appearing in the middle of words, and
>> quotation marks replaced by non-ASCII characters.
>>
>> Can anyone recommend a conversion filter? Not much luck with Google.
>> It shouldn't need much more than a small tr or sed script if one knew
>> the translations.
>>
>> Thanks,
>> Johann
This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.
More information about the NZLUG
mailing list