{"id":73,"date":"2010-12-07T08:17:36","date_gmt":"2010-12-07T08:17:36","guid":{"rendered":"http:\/\/ailab.ijs.si\/ailab\/?page_id=73"},"modified":"2014-06-12T12:43:49","modified_gmt":"2014-06-12T12:43:49","slug":"pre-processing-of-documents","status":"publish","type":"page","link":"https:\/\/ailab.ijs.si\/si\/tools\/text-garden\/pre-processing-of-documents\/","title":{"rendered":"Pre-processing of Documents"},"content":{"rendered":"<p><\/p>\n<ul>\n<li><strong>Text To Compact-Documents      Converter (Txt2Cpd) <a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Txt2Cpd.zip\">Download<\/a><\/strong><br \/>\nTransforms various raw text formats, such\u00a0as Text-Base and some       standard datasets (eg., Reuters) into the file in Compact-Documents  format      (\".Cpd\").<br \/>\n<a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Txt2Cpd.htm\">Parameters and      example call<\/a>.<\/li>\n<\/ul>\n<ul>\n<li><strong>Text To Bag-Of-Words Converter      (Txt2Bow) <a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Txt2Bow.zip\">Download<\/a><\/strong><br \/>\nTransforms various raw text formats, such\u00a0as Text-Base,       Transactions-File, Compact-Documents-File, some standard datasets (eg.,       Reuters) into the file in Bag-Of-Words format \".Bow\".<br \/>\n<a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Txt2Bow.htm\">Parameters and      example call<\/a>.<\/li>\n<\/ul>\n<ul>\n<li><strong>Html To Xml Converter      (Html2Xml) <a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Html2Xml.zip\">Download<\/a><\/strong><br \/>\nTransforms Html documents into cleaned XML documents.<br \/>\n<a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Html2Xml.htm\">Parameters and      example call<\/a>.<\/li>\n<\/ul>\n<ul>\n<li><strong>Html To Text Converter      (Html2Txt) <a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Html2Txt.zip\">Download<\/a><\/strong><br \/>\nTransforms Html documents into cleaned text documents.<br \/>\n<a href=\"http:\/\/kt.ijs.si\/Dunja\/textgarden\/Html2Txt.htm\">Parameters and      example call<\/a>.<\/li>\n<\/ul>\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Text To Compact-Documents Converter (Txt2Cpd) Download Transforms various raw text formats, such\u00a0as Text-Base and some standard datasets (eg., Reuters) into the file in Compact-Documents format (\".Cpd\"). Parameters and example call. Text To Bag-Of-Words Converter (Txt2Bow) Download Transforms various raw text formats, such\u00a0as Text-Base, Transactions-File, Compact-Documents-File, some standard datasets (eg., Reuters) into the file in Bag-Of-Words [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"parent":26,"menu_order":6,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":"","_links_to":"","_links_to_target":""},"class_list":["post-73","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/pages\/73","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/comments?post=73"}],"version-history":[{"count":0,"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/pages\/73\/revisions"}],"up":[{"embeddable":true,"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/pages\/26"}],"wp:attachment":[{"href":"https:\/\/ailab.ijs.si\/si\/wp-json\/wp\/v2\/media?parent=73"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}