Project Title: Company Name Data Normalization and Standardization
I have a list of about 5500 company names.
Such as this:
Fortune Development Sales
Fortune International Group
Fortune International Realty
Fosun Property Holdings
Fountainhead Partners US
Four News Development
FOUR SEASONS HOTEL (Whistler Canada)
Four Seasons Hotel Las Colinas
Fowler, White et al
Many of these names are for companies that are the same company, but i am trying to group and deduplicate the companies and make a single record. So part of this process is to clean up and standardize the names.
I am looking for someone that has done this before with great success.
I would like for a deliverable to be some sort of script that I can run locally on my computer through a command line or integrateable into my ETL workflow which is based on PENTAHOO PDI.
For similar work requirements feel free to email us on email@example.com.
Add a comment