NDDW's data cleansing, analysis and
enrichment services can help you improve the quality of your data.
These services include the aggregation, organization and
cleansing of your data.
These data
cleansing, scrubbing and enrichment services can ensure that your databases -
part and material files, catalog processing files, and item information etc. -
are current, accurate and complete.
Data Cleansing:
Existing data being derived from many sources, often has no consistent format
or it contains duplicate records/items and may have missing,
incomplete or non required descriptions. Our data cleansing process fixes misspellings,
abbreviations, and other errors.
The data is normalized so that there is a common
unit of measure for items in a class, e.g. feet, inches, meters, etc. are
all converted to one unit of measure. The values are also standardized so
that the name of each attribute is consistent, e.g. inch, in., and the
symbol " are all shown as inch.
Data scrubbing service is the process of amending or
removing data in a given database that is incorrect, incomplete, improperly
formatted or duplicated. An organization in a data-intensive field like
banking, insurance, telecommunications, retailing or transportation might
use a data scrubbing tool to systematically examine the same for flaws by using
rules, algorithms, and look-up tables.
Typically a database scrubbing tool
includes programs that are capable of correcting a number of specific type
of mistakes, such as adding missing zip codes or finding duplicate records.
Using a scrubbing tool can save a database administrator a significant
amount of time and can be less costly than fixing errors manually.
The issues involved in data cleansing,
formatting, converting and preparing for upload are so time
consuming and so exacting that it makes sense to outsource
select components of the project to an established firm with
extensive experience in data migration.
Our capabilities for cleansing/scrubbing and enrichment services
include:
-
Aggregation, organization, and cleansing
-
Enrichment with product attributes, images, and manufacturer
specifications
-
De-duplication: eliminate duplicate records which might be similar
looking records
-
Identification of missing or incomplete data
Our company offers a data cleansing service to clean and tidy-up your data. Depending upon
your requirements this may involve:
-
The identification and removal of duplicated records
-
The identification and tagging of similar records with subsequent
manual review
-
The removal of spurious and invalid records
-
Data validation (for example using a post code checker to identify
that addresses are correct)
-
The removal of obsolete data
-
The comparison and removal of records matching third party
information, such as the opt-in and opt-out lists
-
Postal code sorts and verification - It is Ensured that data is
compliant with the Canada Post, US Post, as per rules in your native
country, that pertain to postal/zip codes, correct spellings of street
names, and updated based on National Change of Address data (available
from postal authorities).
-
Genderization - We add or audit for gender specific
consistency (eg: Mr. James Diamond , not Miss James Diamond).
-
Upper/Lower Case Conversion - Conversion of data to upper or lower
case as per you requirement. We also take into account different
language punctuation norms like the accent in French etc. We also pay
attention to specifics like 'McDonald' or 'O'Reilly'.
-
Miscellaneous Services - Abbreviation expansion, Currency
conversions, data integrity audits (so fields that should contain
numbers don't have text in them), de-duping and data appending based on
other files (often a client supplied house file, master client list,
product list, price list, etc)
|