| |
The people behind the Katrina Data Project aren't trained & experienced
rescue workers... but we do have experience and expertise in electronic data
storage, standardization, sorting, searching, and matching. We want to use this
knowledge to help you however we can.Here are some of the skills members of our
team have, and want to put at your disposal for managing your own data:
-
Data Aggregation from Multiple Sources and Formats
We have already started the process of contacting
the numerous websites which host "Survivor Registries", "Safe Lists", and other
means of re-connecting disaster victims, and are bringing all their data into
one common format and location.
We can also bring in data collected by your relief organization and collate it
with other data we have collected, making a central source for dissemination
information back to you and other orgs, as well as a create central the public
can rely on to have the most complete data available.
-
Notification of Individuals Based on Data Cross-Referencing
We have created the infrastructure and software to
automatically notify (via email) people who have active "searches".
As we bring your MSP data into the project, it will be cross referenced with
existing data, and searchers already in the project who match safe persons in
your data will be notified. In addition, if we receiver searcher data from you
we can notify them of the status of any individuals who have been entered into
the Project from other data sources.
Finally, if you have data on both searchers and safe persons, but no method in
place to match or notify them, we can provide this functionality for you.
-
Data Normalization & Cleaning
After gathering data from various sources, we
abstract certain elements of it to aid in organization, searching, and sorting.
If your organization has "dirty" data, we can bring it into a more usable
standard format for you and assist you in utilizing it effectively.
-
Address Standardization
One of the most important parts of cross-referencing
and matching contact data is the standardization of addresses.
Donated services from Intelligent Search Technology and the experience of the
KDP Team allow us to "clean" addresses into the a standard format, turning
text-data typed by users into data elements which make matching and cross
referencing more accurate and effective. This allows the Data Project to be an
accurate, easy-to-use resource for matching searchers to safe persons.
-
"Fuzzy" Demographic Matching
We have developed algorithms which allows us to
generate high match rates while maintaining a high degree of match confidence.
Our cross-referencing system makes searching much easier on public users by
automatically performing partial matches across multiple possible field
mappings. Instead of trying to search 50 times, the user enters their criteria
once, and the system performs 50 searches for them, bringing back the most
relevant results.
This same system is applied to bulk data as it is loaded into the project.
-
Record De-Duplication
Another critical aspect of data management is the
removal of duplicate records... without the loss of valid records. Our team has
experience dealing with large contact and demographic data sets merged from
multiple sources where duplication is a given, and merging together duplicate
records to ensure the most complete set of data for each record is created and
retained.
-
Mobile Data Management
Our team has already begun work on a mobile
application to access The Project's data which can be accessed from any
internet-enabled phone or PDA.

|
|