Biomatch.Domain
0.31.0
dotnet add package Biomatch.Domain --version 0.31.0
NuGet\Install-Package Biomatch.Domain -Version 0.31.0
<PackageReference Include="Biomatch.Domain" Version="0.31.0" />
paket add Biomatch.Domain --version 0.31.0
#r "nuget: Biomatch.Domain, 0.31.0"
// Install Biomatch.Domain as a Cake Addin #addin nuget:?package=Biomatch.Domain&version=0.31.0 // Install Biomatch.Domain as a Cake Tool #tool nuget:?package=Biomatch.Domain&version=0.31.0
Biomatch
System to match records utilizing demographic data.
CLI Installation
To build the project you need the .NET 8 SDK. You can download it from here.
Build the cli tool utilizing the following command:
dotnet publish -c Release src/Biomatch.CLI
This will create a "matching" executable file for your current architecture. To build for other architectures, pass the -r parameter as the following example:
dotnet publish -c Release -r linux-x64 src/Biomatch.CLI -o biomatch
For more information on the -r parameter, see the dotnet publish documentation. For a list of supported runtimes, see the dotnet RID catalog.
CLI Usage
The cli tool has a help command that will list all available commands and their parameters.
biomatch --help
Template
The template command will create a template file for the matching engine. The template file is a csv utf-8 file. The template file is used for most matching commands. The template file has the following columns:
Column Name | Type | Description |
---|---|---|
RecordId | string | The unique id of the record in the source. |
FirstName | string | First name of person. |
MiddleName | string | Middle name of person. |
LastName | string | Last name of person. |
SecondLastName | string | Second last name of person. |
Birthdate | DateTime | Birth date of person. |
City | string | Physical residence city of person. |
PhoneNumber | string | Primary phone number of person. |
biomatch template generate -o <output file path>
Find matches between 2 files
Create or use an existing template. (See Template section for more information)
biomatch template generate -o <output file path>
Create word frequency dictionary based on sample data to improve matching accuracy.
biomatch dictionary generate <templateFilePath> -o <output folder path>
Find matches between 2 files with a score threshold of 0.85. The following command assumes that the Dictionary has already been generated and is located in the same directory as the executable.
biomatch find matches <templateFilePath1> <templateFilePath2> -o <output file path> --score 0.85
This will generate a file with all matches found and scores for each one. The higher the score, the more likely the records match.
Find duplicates between 2 files
Create or use an existing template. (See Template section for more information)
biomatch template generate -o <output file path>
Create word frequency dictionary based on sample data to improve matching accuracy.
biomatch dictionary generate <templateFilePath> -o <output folder path>
Find duplicates between 2 files with a score threshold of 0.85. The following command assumes that the Dictionary has already been generated and is located in the same directory as the executable.
biomatch find duplicates <templateFilePath1> <templateFilePath2> -o <output file path> --score 0.85
This will generate a file with all duplicates found and scores for each one. The higher the score, the more likely the records are duplicates.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net9.0 is compatible. |
-
net9.0
- libphonenumber-csharp (>= 8.13.50)
- Quickenshtein (>= 1.5.1)
- SymSpell (>= 6.7.2)
- System.Private.Uri (>= 4.3.2)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
0.31.0 | 85 | 12/1/2024 |
0.30.0 | 387 | 7/18/2024 |
0.29.0 | 327 | 3/8/2024 |
0.28.0 | 115 | 2/29/2024 |
0.27.0 | 144 | 2/14/2024 |
0.25.0 | 427 | 11/21/2023 |
0.24.1 | 129 | 11/16/2023 |
0.24.0 | 129 | 11/16/2023 |
0.23.0 | 148 | 10/12/2023 |
0.22.0 | 150 | 9/28/2023 |
0.21.0 | 146 | 9/7/2023 |
0.20.0 | 136 | 9/5/2023 |
0.19.0 | 200 | 8/15/2023 |
0.18.0 | 169 | 7/11/2023 |
0.14.0 | 183 | 6/15/2023 |
0.13.0 | 140 | 5/16/2023 |
0.12.0 | 143 | 5/16/2023 |
0.11.0 | 164 | 5/3/2023 |
0.10.0 | 174 | 4/18/2023 |
0.8.0 | 197 | 3/29/2023 |
0.7.0 | 204 | 3/20/2023 |