Everyday Data Jadoo Big Data Blog

DATA SCIENCE FOR SOCIAL GOOD FELLOWSHIP 2016

Data Science for social good

Once again, Data Science for Social Good will be hosting a fellowship during the summer of 2016. They have a summer program that trains data scientists by having them work on important and impactful projects. The fellowship is paid and it lasts 12 weeks in Chicago, Illinois. The deadline to apply is February 1, 2016, so you need to hurry if you are planning to apply. The projects vary every summer, but a list of potential and past projects include:

Read more

Craig’s List Downloader

The Goal of this project is to download all Craigs list listings by user and dealer.
The first step is to collect data.

The user decides what section of the website to be scrapped and how many pages or listings to be downloaded.
Once you enter this information the scrapper will download all the information and store them in a Database for you.

This can be used for further analysing.
We used the programming language python for this project.
Craig’s list features owners and dealers to sell items in their site. Our final database will also have a section whether this itemis listed by owner or dealer.

The libraries we use are urllib, Sqlite3, requests, Beautifulsoup ,re and operator.
The program is two stage
Stage 1: Individual page link down-loader.

Program

 

Stage 2 : Individual page info down-loader.

link_downloader

Once the we run these two programs, Python creates a database with all the necessary information.

3 tables are created

1.Total page link table.

 

Pages_to_be_downloaded

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

2.Individual links for each page in a table.

Indivduval_

3.Final information of the listing.

page_info

We finally generated 2500 listings per run.

Hadoop All Grown Up

hadoop on highway

Grab the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week commentary focused on Hadoop in action. Walmart discussed its plans for Hadoop, and an article on CIO.com discussed Hadoop’s impact on the insurance industry. See the full stories below..

Read more

Page 9 of 12« First...7891011...Last »

BLOG POSTS

ADDRESS

650 Parliament Street, Toronto,Ontraio, Canada
Phone: (416) 939-0044
Fax: (647) 720-2214
Website: http://www.datajadoo.com
Email: info@datajadoo.com

DISCLAIMER

Important:: This site has been setup purely for showcasing the analytic's skills of Data Jadoo. All the content are designed by Data Jadoo. Author retains his or her views on the topics expressed here. All images are copyrighted to their respective creators.