FAQs About Address Retrieval and Customer Identification
From what address volume is a tool for address identification meaningful?
As soon as you lose the manual overview by means of simple comparison, duplicates creep in. One loses track after about 500 addresses. By sorting one can possibly keep track of up to about 10,000 addresses. Beyond that a tool for preventing duplicates is indispensable.
What makes mailRetrieval different from name and address searching with "wildcards", which are available as standard in all databases?
-
The retrieval of results is much faster. Even with volumes of more than a million addresses the response time is usually less than a second.
-
Searching with mailRetrieval can be error-tolerant even without wildcards. This is important to mention, because entering wildcards presumes that the user has an idea where a deviation could be!
-
Duplicate check can be carried out automatically without any action by the user for every new entry or change. Hence, even without intensive research by the user the genesis of duplicates is automatically prevented as far as possible.
-
Certain identification ensues for example when
- words are interchanged Franks John instead of John Franks
- words are abbreviated: J. Franks instead of John Franks
- the name sounds different from the way it is spelled: Francks instead of Franks
- there has been a simple data entry error: ranks instead of Franks
-
The results are returned according to the degree of agreement. This facilitates operation considerably and spares endless leafing through result lists.
What is the maximum volume of addresses that can be managed with mailRetrieval?
The maximum technical limit is greater than all of the results that occur in practice. Of course, the address database size to be managed also depends on the available performing resources (CPU and hard disk drive and motherboard). In practice, the volume of addresses stored in the database for real time address identification is not so significant, since this can possibly be distributed to several countries. What is important is the volume of managed addresses per country performing without segmentation.
A practical example for this: some users of mailRetrieval manage address databases with more than 60 million redundancy-free consumer addresses per country.
Do larger address databases have to be segmented for mailRetrieval?
No! Segmentation is a technique, which usually has to be used by systems with simple technologies to achieve an acceptable performance also for very large databases. The big disadvantage of segmentation is that usually between the segments no duplicate recognition occurs. While comparing there are "blind spots" in which a sizable number of unrecognized duplicates remain. For very large international applications it is advisable to divide up according to countries simply for organizational reasons.
What kind of performance does mailRetrieval offer?
The concrete performance depends significantly on the resources (CPU and hard disk drive and motherboard). Because Uniserv has developed special error-tolerant searching methods for address identification in real time, this provides high performance with low demand on resources. Here is a practical example:
The management of 10 million addresses without segmentation on a rather small Intel server with 1 GHz and 500MB RAM leads to an average response for an error-tolerant search of all name and address fields of less than 0.5 seconds!
Does mailRetrieval ensure for the update that the data integrity and the transaction security between the database and the mailRetrieval index structure are guaranteed?
Yes. Here different methods are used depending on the environment it is used in. As a general solution in the client/server environment, mailRetrieval supports the two-phase commit protocol.
Besides in interactive applications, can mailRetrieval also be used in batch applications as well?
Yes. Especially when in comparison to the database size only small amounts of addresses must be integrated and only short reaction times are necessary, the automatic duplicate recognition is used and also the dynamic clustering of addresses in a batch.
Should the automatic duplicate check be run only when entering new addresses or should it also be run for address changes?
It is important, that besides new addresses also address changes are checked for duplicates. Experience shows that with companies that only check new entries a not inconsiderable number of duplicates get into the address database via the change function. Your contact partner at Uniserv responsible for these issues will be glad to give you additional background information regarding this point.
Does mailRetrieval need additional connectivity software in client/server operation?
No. All needed components are contained in the mailRetrieval server and the respective client kit. The Uniserv client/server technology builds directly on the TCP/IP system software present in every operating system. Additional connectivity software is not needed.
Is mailRetrieval also able to identify addresses suspected of being duplicates, if these are each valid but different or similar? (For example turned around numbers in the house number or similar, but respectively valid streets.)
Of course. Since mailRetrieval does not need any segmentation, the software has absolutely no problem with that. This situation occurs with systems which have to implement segmenting for large databases – sometimes as far down as to the household level - in order to achieve an acceptable performance.
Does updating with mailRetrieval always have to be combined with conducting a postal address check?
No, but using these systems together is recommended. Generally, you can use mailRetrieval without running a postal address check. However it is advisable to run a postal address check before entering new addresses into the database or changing addresses to make sure that there are not only no duplicates but that only correct addresses are stored in the database.
Can an error-tolerant search for addresses be undertaken with mailRetrieval via other information than name or address elements?
Of course! Error-tolerant searching is e.g. possible via the telephone number, the e-mail or WEB address, date of birth or any other fields such as the VAT (value added tax) number, the customer number, the credit card number or similar information.
Why is it recommended to check an address database periodically with mailBatch even when it is managed real time with mailRetrieval?
In most applications it is customary that when duplicates are suspected, the user who is entering new addresses or changing them receives a message that can either be accepted or rejected. Through faulty operation or, because final clarification at short notice is not possible, even in such systems the unconscious or willingly accepted entry of duplicate or multiple addresses takes place.
You can track these cases down in a periodical mailBatch run and clarify them asynchronously.