Posts

Scraping webpages is an important part of many workflows involving NLP. It's challenging though, because webpages are complex, and there's a…

I have been working with Apache Spark on Amazon's EMR recently, and it was a bit time consuming to manually upload my assembly (fat) jars…