Case Studies Of Web Data Extraction / Data Cleansing

Industries Served: Financial Services | Life Sciences | Hospitality | CPG | Industrials | PE

Collecting Data From Various Firmographics

An Industry leader in value selling automation was launching a new product that provided Company Insider Briefs to their customers. HDG was hired to pull data from various firmographics, and to systemically create numerous insights, financial analysis, competitive peer analysis, etc. for over 82 industries.

Web Extraction Exercise Various Business Location

For a PE acquired company, HDG performed an extensive web extraction exercise for over 2000 business locations. The information was scraped from the company websites and other Social media sites, such as YELP, Google, etc. Data elements such as content and number of reviews, website addresses and contact information was extracted using custom algorithms developed in-house.

Extracting Details From PDF

HDG was recently retained to extract predetermined information from over 9,000 pdfs. The originals were in French. HDG first translated the pdfs into English, and the custom algorithms were developed to extract the necessary data, and the subsequent collating in Excel.