smilax:: WebLog #1 Topic : 2005-09-01 19.23.27 strick : Markov3 (by Joe Buck) | [Changes] [Calendar] [Search] [Index] [PhotoTags] |
[Bedstraw] *smilax* |
|
[Back to weblog: Smilax.org] |
I tried it on a modern Linux system today (in Sept 2005) and it still makes. (But you young ones will have to figure out what command you use to extract a "shar" file!)
For best results, feed it plain text files with headers and HTML stuff and URL stuff and even Section Headers (titles) removed. It's designed to skip over RFC822 (mail or netnews) headers.
Oh, also you must remove any CR characters at the end of each line of the input files (that is, use UNIX newline convention). The UNIX command tr -d '^M' < input.txt > output.txt (where you probably type control-V before the control-M) should clean that for you.
(last modified 2005-09-01) [Login] |