Blog

27
Sep

Company Name Data Normalization and Standardization

Post By admin

Project Title: Company Name Data Normalization and Standardization

Project Description:
Hello,

I have a list of about 5500 company names.

Such as this:
Fortun Insurance
FORTUNATO’S
Fortune
Fortune Development Sales
Fortune International
Fortune International Group
Fortune International Realty
Fortune Intl
Fortune Magazine
FORTUNE REALTY
Fosun Property Holdings
FOUNDATION
Foundation Source
Fountainbleu Aviation
FOUNTAINBLEU Hotel
Fountainhead Partners US
Four News Development
Four Seasons
FOUR SEASONS HOTEL (Whistler Canada)
Four Seasons Hotel Las Colinas
Fowler, White et al

Many of these names are for companies that are the same company, but i am trying to group and deduplicate the companies and make a single record. So part of this process is to clean up and standardize the names.

I am looking for someone that has done this before with great success.

I would like for a deliverable to be some sort of script that I can run locally on my computer through a command line or integrateable into my ETL workflow which is based on PENTAHOO PDI.

For similar work requirements feel free to email us on info@datacleaningservices.com.

  • Facebook
  • Twitter
  • Google Plus
  • Linkedin

Add a comment