LinkSolv 8.3 Help Pages and User Guide



Average Agree Weights

1. Average Agree Weights
Average Agree Weights. Choose Y to impute using average agree weights, N to use frequency-specific weights.
Average Agree Weights. Match weights are logarithms of likelihood ratios. Match probabilities are calculated from likelihood ratios which take into account agreements and disagreements on all data fields which are compared in your linkage model. Match probabilities are used to draw multiple imputations of true link status for all candidate record pairs from a Bayesian posterior distribution. In general, an agreement on rare values (say, Age = 99) receives greater weight than an agreement on common values (say, Age = 30). Disagree weights are the same for all data values because the linkage model assumes that the probability of incorrect values is independent of the true data value. Different agree weights can cause biases in linked datasets used for analysis if you are using Age (or any other field in your linkage model) as an analysis variable. People with rare ages are less likely to have false negative matches (missing links) than people with common ages. You can remove this bias by specifying Y for Average Agree Weights.

Confirm Messages

2. Confirm Messages
Confirm Messages. Choose Y to show confirmation messages for your response, N to hide.

Data Provider

3. Data Provider
Data Provider. Choose a data provider for application databases: JET=Access JET, SQL=SQL Server.
Data Provider. Each new project is created for either the Access JET Data Provider or the SQL Server Data Provider, depending on which you specify on the Options Tab. If you specify Access JET as Data Provider, all data tables for the project are built in Access databases (*.mdb) in a local folder with the same name as the project. Multiple databases are created because of size limitations in Access. The default location for the project folder is the same as the default location for new databases (Access Options > General). If you specify SQL Server as Data Provider, all data tables for the project are built on the server in a single database. The server and database must be identified as an ODBC Data Source in a System DSN named STMT_ODBC (see When Using SQL Server). An Access Data Project (*.adp) is created in the default project folder and connected to the specified SQL Server database. You will have to provide information requested by the New Connection dialog.

Debug Messages

4. Debug Messages
Debug Messages.. Choose Y to show debug messages in the Immediate window, N to hide.

False Negatives

5. False Negatives
<TODO>: Insert description text here...

Fast Self Match

6. Fast Self Match
Fast Self Match. Choose Y to test pair (A,B) but skip pair (B,A), N to test both pairs.

Goodness of Fit

7. Goodness of Fit
<TODO>: Insert description text here...

Probability Error

8. Probability Error
  1. Probability Error. Choose a default probability for values reported incorrectly.

Random Sample

9. Random Sample
Random Sample Percent. Choose a sample size (% of A X B) for random samples of record pairs.
Random Sample Percent. Random samples of record pairs from all possible pairs (A x B) are used to estimate the effects of comparisons with tolerances and the effects of comparisons with dependent match fields. Larger sample sizes give more accurate estimates of population statistics. Smaller sample sizes give faster run times. Sample sizes of a few million pairs can be analyzed in a few minutes. You can calculate the size of (A x B) by multiplying the number of records in table A times the number of records in table B.

Random Seed

10. Random Seed
Random Sample Percent. Choose a sample size (% of A x B) for random samples of record pairs.

Reference State

11. Reference State
Reference State. Choose a state for displaying location reference tables.
Authored with help of Dr.Explain