Archive for November, 2010
Crawling the Web With Lynx
Introduction There are a few reasons you’d want to use a text based browser to crawl the web. For example, it makes it easier to do natural language processing on web pages. I was doing this a year or two ago, and at the time I was unable to find a Python library that would […]
Posted on November 9, 2010 at 10:38 am by Joe · Permalink
· 6 Comments
In: Python · Tagged with: nlp, web crawling
In: Python · Tagged with: nlp, web crawling