Monthly Archives: March 2011

Content Extraction at FiveFilters.org

Full-Text RSS 2.7 from FiveFilters.org is now available. I thought I’d write about one area of improvement in this release: content extraction. Automatic Extraction Up to now we’ve relied mainly on PHP Readability to automatically identify and extract articles from web pages, and this is still how the majority of articles are extracted. It works […]

Posted in General | 10 Comments

Workers of the World Relax

Source: workersoftheworldrelax.org (via Medialens)

Posted in General | Leave a comment