Hello,
In my application i want to read the data from web pages .
Webpages inclue jpeg,text etc data but i need to extract only text data from the webpages so extracted data, i can pass through datastage.
could you people share your exeperience on this issue.
Thanks
HH
Data from Webpages To Datastage
Moderators: chulett, rschirm, roy
Where are your webpages being read from? Normally the pages will contain only HTML text and reference other non-text data. There is a Click Pack to help you work with the log files, but it doesn't help with the actual pages.
You can strip out unprintable text (using the ;'MCP' conversion) to clean out binary data from a text stream, but in DataStage you would still need to parse the data into useable information.
You can strip out unprintable text (using the ;'MCP' conversion) to clean out binary data from a text stream, but in DataStage you would still need to parse the data into useable information.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact: