聯系方式

您當前位置:首頁 >> Web作業Web作業

日期:2019-11-18 09:33

Project QQ Zone web scraping (a social media website)


Introduction:

The code to scrap data from this website makes use of requests, beautifulsoup, selenium, time and cookie. The difficulty is the use of selenium and cookie. Selenium has been explained in out tutorials. An HTTP cookie is a small piece of data sent from a website and stored on the user's computer by the user's browser while the user is browsing. It lets server programs track what pages a user has visited or what actions the user has performed. Some websites require log-in to read all data, that’s when we need a cookie in our code.


Requirement:

1.Website link: http://user.qzone.qq.com/QQ account  

2.Write a code to extract the following data:

a.Input QQ account

b.ShuoShuo(which is similar to twitter posts)

c.Posting time

3.Store the data in an excel



Codes will be graded based on:

1.The format of the codes

2.The organization of the csv file

3.The speed of the codes

4.The extendibility of the codes


Estimated completion time: 20 hours. Time consumption depends on the codes’ quality.

Suggestive points:

Deadline:


Penalty:

1.Late delivery/missing deadline will incur -10% of finally determined points of the whole project per missing day.


相關文章

【上一篇】:到頭了
【下一篇】:沒有了

版權所有:編程輔導網 2018 All Rights Reserved 聯系方式:QQ:99515681 電子信箱:[email protected]
免責聲明:本站部分內容從網絡整理而來,只供參考!如有版權問題可聯系本站刪除。

黑龙江体彩22选5