- Text To Compact-Documents Converter (Txt2Cpd) Download
Transforms various raw text formats, such as Text-Base and some standard datasets (eg., Reuters) into the file in Compact-Documents format (“.Cpd”).
Parameters and example call.
- Text To Bag-Of-Words Converter (Txt2Bow) Download
Transforms various raw text formats, such as Text-Base, Transactions-File, Compact-Documents-File, some standard datasets (eg., Reuters) into the file in Bag-Of-Words format “.Bow”.
Parameters and example call.
- Html To Xml Converter (Html2Xml) Download
Transforms Html documents into cleaned XML documents.
Parameters and example call.
- Html To Text Converter (Html2Txt) Download
Transforms Html documents into cleaned text documents.
Parameters and example call.