So my problem is that I have extracted a lot of forum posts into separate txt files which are now at my harddrive. Each file contain information I would like to extract, which some i already have figuered out how to extract. The information i need to extract is in the following form:Within same "html block"1: (x) messages in this thread
2: Message is in reply to (some html code) A HREF="http://stackoverflow.com/questions/12332335/link" (some html code=In task 1 is simply need to extract x
In task 2 i need to extract the links to which the message is in reply toI have looked into the different tm and XML packages but have not been able to actually find out what to use. Any advice is appreciated.BestKasper
2: Message is in reply to (some html code) A HREF="http://stackoverflow.com/questions/12332335/link" (some html code=In task 1 is simply need to extract x
In task 2 i need to extract the links to which the message is in reply toI have looked into the different tm and XML packages but have not been able to actually find out what to use. Any advice is appreciated.BestKasper