Web Crawler

- 1 min

Web Crawler

A web crawler crawling technology articles.

I crawled url, article title, time, link, content and attitude from the following websites and store them in the PostgreSQL database. I then searched processed the content of the articles using the target words provided, collected them to generate a weekly news report. I also uploaded this work on to Amazon EC2 so that it could automatically send emails.

This work has highly increased the efficiency for my intern VC firm to find potential investment opportunities.

First demo (word_count): using Flask platform for manually input of interested search words Github Link.

Final version for this project: Github Link.

Crawling Websites:

Report Sample

comments powered by Disqus
rss facebook twitter github youtube mail spotify lastfm instagram linkedin google google-plus pinterest medium vimeo stackoverflow reddit quora