We´re interest in building a very large data base of FinTech startup (all around the world) using scraping tools but we´ve some doubts are what is feasible and what should be needed.
We´d like to scrap different websites, directories, and databases, and find any website where we can obtain different info about the FinTech startup. In some sites, there will be more info and in others the info related to the startups will be limited.
Some of the info we want to scrap is the following: Name of startup, website url, contact e-mail, description, social media url (twitter, facebook, linkedin, youtube, crunchbase), foundation year, hq location (country and city), nº of employees, key words, logo image, video or remarkable images, investors, …
we have the following questions:
• what kind of info could you scrap? can you get some info from a directory of startups (i.e.
Crunchbase) and then other from the website of the startup or other sources?
• Do you need that we tell you the websites to be scrapped and the info needed from each? Or you can obtain it with an auto-sufficient and smart algorithm?
• Can you scrap or create a video from the website or youtube? Can you scrap images?
• Could you do scraping from sources like Crunchbase, Dealroom, Tracxn, or FinTechdb?
Category: IT & Programming
Subcategory: Data Science
Project size: Medium
Is this a project or a position?: Project
Required availability: As needed