Get Web Page

Retrieves web-document from the address specified in input URL parameter ('-i'). Web-document is saved in http file ('-ohttp' parameter), http-body file ('-o' parameter), xml file ('-oxml' parameter), text file ('-otxt' parameter) or to the screen ('-oscr' parameter).


usage: BowHKMeans.exe
-i:Input-Url (default:'')
-ohttp:Output-Http-File (default:'WebPage.Http')
-o:Output-Http-Body-File (default:'WebPage.Body')
-oxml:Output-Xml-File (default:'WebPage.Xml')
-xotxt:Xml-Output-Text (default:'T')
-xourl:Xml-Output-Urls (default:'T')
-xotok:Xml-Output-Tokens (default:'T')
-xotag:Xml-Output-Tags (default:'T')
-xoarg:Xml-Output-Arguments (default:'T')
-otxt:Output-Text-File (default:'WebPage.Txt')
-tourl:Text-Output-Urls (default:'F')
-totag:Text-Output-Tags (default:'F')
-oscr:Output-To-Screen (default:'F')

Example:
 

GetWebPage.exe -i:http://www.ijs.si/

The above example call retrieves (fatches) the web-document at http://www.ijs.si/.