Simple Scraping Task
I need a script that scrape URLs, and store the data in the database or excel file. 1. Scrape data from this link http://www.forbes.com/lists/2010/18/global-2000-10_The-Global-2000_Rank.html 2. I want to scrape the top 1 – 1000 companies (There should be 8 columns: Their Rank, Company, Country, Industry, Sales ($bil), Profits ($bil), Assets ($bil), Market Value ($bil) 3. Type in the company name in Google and scrape the URL of the first result (usually their website) 4. Then use that URL to check if their home page contains their facebook fan page by searching for a portion of this url (http://www.facebook.com/, if yes, scrape the entire facebook URL, if no, mark as no) 5. Again, type in company name in Google and check to see if the top 100 search results contain a portion of this URL (www.facebook.com). If so, scrape the entire facebook URL, if no, mark as no. Each of the 1000 companies should have the following information 1.Their Rank 2 Company 3. Country 4. Industry 5. Sales ($bil) 6. Profits ($bil) 7. Assets ($bil) 8. Market Value ($bil) 9.Their website url 10. Their facebook fan page url if they have one Keywords: Web Programming, PHP
| Expired |
More php projects
View AllRelated projects
Search for freelance jobs
"I did not know what to expect at first. But my final impression once I used your site and service is a great one! Simply amazing!
I would recommend this service to any other freelance artists and co workers who are looking to expand their client base."
"The possibility to include all information about my freelance working places in just one website. It means, I don't need to tell my future employer to go to odesk, elance, etc. They can check everything about me in donanza website."




