SoundFingerprinting 11.2.0
dotnet add package SoundFingerprinting --version 11.2.0
NuGet\Install-Package SoundFingerprinting -Version 11.2.0
<PackageReference Include="SoundFingerprinting" Version="11.2.0" />
paket add SoundFingerprinting --version 11.2.0
#r "nuget: SoundFingerprinting, 11.2.0"
// Install SoundFingerprinting as a Cake Addin #addin nuget:?package=SoundFingerprinting&version=11.2.0 // Install SoundFingerprinting as a Cake Tool #tool nuget:?package=SoundFingerprinting&version=11.2.0
Audio/Video fingerprinting and recognition in .NET
soundfingerprinting is a C# framework designed for companies, enthusiasts, researchers in the fields of digital signal processing, data mining and audio/video recognition. It implements an efficient algorithm which provides fast insert and retrieval of acoustic and video fingerprints with high precision and recall rate.
Documentation
Full documentation is available on the Wiki page.
Below code snippet shows how to extract acoustic fingerprints from an audio file and later use them as identifiers to recognize unknown audio query. These fingerprints will be stored in a configurable datastore.
private readonly IModelService modelService = new InMemoryModelService(); // store fingerprints in RAM
private readonly IAudioService audioService = new SoundFingerprintingAudioService(); // default audio library
public async Task StoreForLaterRetrieval(string file)
{
var track = new TrackInfo("GBBKS1200164", "Skyfall", "Adele");
// create fingerprints
var avHashes = await FingerprintCommandBuilder.Instance
.BuildFingerprintCommand()
.From(file)
.UsingServices(audioService)
.Hash();
// store hashes in the database for later retrieval
modelService.Insert(track, avHashes);
}
Querying
Once you've inserted the fingerprints into the datastore, later you might want to query the storage in order to recognize the song those samples you have. The origin of query samples may vary: file, URL, microphone, radio tuner, etc. It's up to your application, where you get the samples from.
public async Task<TrackData> GetBestMatchForSong(string file)
{
int secondsToAnalyze = 10; // number of seconds to analyze from query file
int startAtSecond = 0; // start at the begining
// query the underlying database for similar audio sub-fingerprints
var queryResult = await QueryCommandBuilder.Instance.BuildQueryCommand()
.From(file, secondsToAnalyze, startAtSecond)
.UsingServices(modelService, audioService)
.Query();
return queryResult.BestMatch.Track;
}
Fingerprints Storage
The default storage, which comes bundled with soundfingerprinting NuGet package, is a plain in-memory storage, available via <code>InMemoryModelService</code> class. If you plan to use an external persistent storage for fingerprints Emy is the preferred choice. Emy provides a community version which is free for non-commercial use. More about Emy can be found on wiki page.
Supported audio/video formats
Read Supported Media Formats page for details about processing different file formats or realtime streams.
Video fingerprinting support since version 8.0.0
Since v8.0.0
video fingerprinting support has been added. Similarly to audio fingerprinting, video fingerprints are generated from video frames, and used to insert and later query the datastore for exact and similar matches. You can use SoundFingerprinting
to fingerprint either audio or video content or both at the same time. More details about video fingerprinting are available here.
Version Matrix
If you are using FFmpegAudioService
as described in the wiki, follow the below version matrix.
| SoundFingerprinting | SoundFingerprinting.Emy | FFmpeg |
| ---- | ------ |-----|
| 8.x | 8.x | 4.x |
| 9.x | 9.x | 5.x |
| 10.x | 10.x | 6.x |
FAQ
- Can I apply this algorithm for speech recognition purposes?
No. The granularity of one fingerprint is roughly ~1.46 seconds.
- Can the algorithm detect exact query position in resulted track?
Yes.
- Can I use SoundFingerprinting to detect ads in radio streams?
Yes. Actually this is the most frequent use-case where SoundFingerprinting was successfully used.
- How many tracks can I store in
InMemoryModelService
?
100 hours of content with
DefaultFingerprintingConfiguration
will consume ~5GB of RAM.
Get it on NuGet
Install-Package SoundFingerprinting
How it works
Demo
My description of the algorithm alogside with the demo project can be found on CodeProject. The article is from 2011, and may be outdated. The demo project is a Audio File Duplicates Detector. Its latest source code can be found here. Its a WPF MVVM project that uses the algorithm to detect what files are perceptually very similar.
Contribute
If you want to contribute you are welcome to open issues or discuss on issues page. Feel free to contact me for any remarks, ideas, bug reports etc.
License
The framework is provided under MIT license agreement.
© Soundfingerprinting, 2010-2024, sergiu@emysound.com
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- Microsoft.Bcl.AsyncInterfaces (>= 8.0.0)
- Microsoft.Extensions.Logging.Abstractions (>= 8.0.1)
- protobuf-net (>= 2.4.6)
NuGet packages (7)
Showing the top 5 NuGet packages that depend on SoundFingerprinting:
Package | Downloads |
---|---|
SoundFingerprinting.Emy
SoundFingerprinting.Emy is a native C# client for Emy fingerprints storage. It provides ways to insert, query, delete entries from Emy. To learn more about SoundFingerprinting visit github page https://github.com/AddictedCS/soundfingerprinting. To learn more about Emy visit https://www.emysound.com |
|
SoundFingerprinting.Audio.Bass
Un4seen.Bass extension library for SoundFingerprinting framework. SoundFingerprinting default library is designed to read only raw wave files. This extension provides the ability to read all files supported by Bass (including .mp3, .flac, .ogg). Un4seen.Bass is free for non-comercial use. If you intend to use this extension for commercial purposes, please contact http://www.un4seen.com for licensing |
|
SoundFingerprinting.Audio.NAudio
NAudio extension library for SoundFingerprinting framework. This extension provides the ability to read all files supported by NAudio media foundation framework. Since NAudio depends on Windows native calls, this extension can execute only in Windows environment. |
|
SoundFingerprinting.Solr
SoundFingerprinting is a C# framework that implements an efficient algorithm of audio fingerprinting and identification. Designed for developers, enthusiasts, researchers in the fields of audio processing, data mining, digital signal processing. |
|
SoundFingerprinting.SQL
SoundFingerprinting is a C# framework that implements an efficient algorithm of audio fingerprinting and identification. Designed for developers, enthusiasts, researchers in the fields of audio processing, data mining, digital signal processing. |
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
11.2.0 | 415 | 12/19/2024 |
11.1.0 | 733 | 10/29/2024 |
11.0.0 | 335 | 10/3/2024 |
10.7.0 | 731 | 8/8/2024 |
10.6.0 | 266 | 7/9/2024 |
10.4.0 | 340 | 6/5/2024 |
10.3.1 | 284 | 5/17/2024 |
10.3.0 | 148 | 5/16/2024 |
10.0.0 | 869 | 4/5/2024 |
9.5.0 | 227 | 3/27/2024 |
9.4.4 | 872 | 3/2/2024 |
9.4.2 | 409 | 2/27/2024 |
9.4.0 | 317 | 2/26/2024 |
9.4.0-beta1 | 120 | 2/23/2024 |
9.3.0 | 357 | 2/19/2024 |
9.2.0 | 386 | 2/6/2024 |
9.1.0 | 1,043 | 12/28/2023 |
9.0.0 | 1,037 | 11/30/2023 |
9.0.0-beta | 586 | 11/22/2023 |
9.0.0-alpha | 123 | 11/19/2023 |
8.33.0 | 453 | 12/28/2023 |
8.32.0 | 177 | 11/21/2023 |
8.30.0 | 1,126 | 10/25/2023 |
8.28.0 | 1,276 | 9/21/2023 |
8.27.2 | 650 | 9/18/2023 |
8.27.1 | 589 | 9/18/2023 |
8.27.0 | 490 | 9/14/2023 |
8.26.0 | 717 | 9/1/2023 |
8.25.0 | 801 | 8/28/2023 |
8.24.0 | 2,025 | 7/17/2023 |
8.23.0 | 3,799 | 5/15/2023 |
8.22.0 | 4,046 | 4/3/2023 |
8.21.0 | 3,619 | 1/22/2023 |
8.20.0 | 1,196 | 1/12/2023 |
8.19.0 | 2,917 | 12/12/2022 |
8.18.0 | 1,145 | 12/8/2022 |
8.17.0 | 1,177 | 11/30/2022 |
8.16.5 | 1,192 | 11/26/2022 |
8.16.3 | 1,111 | 11/22/2022 |
8.16.2 | 1,128 | 11/16/2022 |
8.16.1 | 1,208 | 11/14/2022 |
8.16.0 | 1,204 | 11/11/2022 |
8.15.0 | 2,673 | 9/20/2022 |
8.14.0 | 1,823 | 8/24/2022 |
8.13.0 | 6,455 | 8/16/2022 |
8.12.1-patch1 | 418 | 12/28/2023 |
8.12.0 | 4,377 | 6/6/2022 |
8.9.1 | 4,367 | 4/27/2022 |
8.9.0 | 1,337 | 4/27/2022 |
8.8.0 | 2,160 | 4/17/2022 |
8.7.0 | 2,469 | 4/6/2022 |
8.6.0 | 2,594 | 3/23/2022 |
8.5.1 | 1,888 | 2/23/2022 |
8.5.0 | 1,402 | 2/18/2022 |
8.4.0 | 1,445 | 2/8/2022 |
8.3.0 | 1,451 | 1/21/2022 |
8.2.0 | 1,638 | 1/6/2022 |
8.1.0 | 1,130 | 12/28/2021 |
8.0.0 | 3,109 | 12/22/2021 |
8.0.0-beta4 | 751 | 12/21/2021 |
8.0.0-beta3 | 826 | 12/8/2021 |
8.0.0-beta2 | 216 | 12/7/2021 |
8.0.0-beta | 229 | 12/7/2021 |
8.0.0-alpha | 228 | 12/6/2021 |
7.17.0 | 3,173 | 11/30/2021 |
7.16.0 | 2,258 | 11/26/2021 |
7.15.0 | 2,930 | 11/9/2021 |
7.14.0 | 1,213 | 11/9/2021 |
7.13.0 | 1,866 | 10/19/2021 |
7.12.0 | 1,280 | 10/13/2021 |
7.10.0 | 3,523 | 6/29/2021 |
7.9.9 | 2,867 | 5/10/2021 |
7.9.6 | 1,383 | 4/30/2021 |
7.9.4 | 1,313 | 4/27/2021 |
7.9.3 | 1,353 | 4/18/2021 |
7.9.0 | 24,349 | 4/5/2021 |
7.8.1 | 2,086 | 3/21/2021 |
7.8.0 | 1,228 | 3/19/2021 |
7.7.0 | 527 | 3/15/2021 |
7.5.5 | 2,068 | 1/25/2021 |
7.5.2 | 1,918 | 1/13/2021 |
7.4.24 | 3,003 | 11/30/2020 |
7.4.19 | 2,617 | 11/6/2020 |
7.4.13 | 5,030 | 8/25/2020 |
7.4.12 | 2,344 | 8/11/2020 |
7.4.11 | 1,410 | 8/4/2020 |
7.4.10 | 1,397 | 7/30/2020 |
7.4.7 | 3,070 | 7/22/2020 |
7.4.6 | 1,404 | 7/20/2020 |
7.4.1 | 3,707 | 6/25/2020 |
7.4.0 | 1,540 | 6/19/2020 |
7.3.3 | 3,228 | 5/12/2020 |
7.3.2 | 2,039 | 5/8/2020 |
7.3.1 | 1,415 | 5/4/2020 |
7.3.0 | 1,445 | 4/25/2020 |
7.2.0 | 2,610 | 3/31/2020 |
7.2.0-beta3 | 1,507 | 3/17/2020 |
7.2.0-beta2 | 1,062 | 3/12/2020 |
7.2.0-beta | 1,221 | 2/27/2020 |
7.1.0 | 2,638 | 2/3/2020 |
7.1.0-beta4 | 1,121 | 12/11/2019 |
7.1.0-beta3 | 1,091 | 11/11/2019 |
7.1.0-beta2 | 514 | 11/5/2019 |
7.1.0-beta | 1,132 | 10/23/2019 |
7.1.0-alpha | 430 | 10/8/2019 |
7.0.0 | 4,593 | 7/1/2019 |
6.3.0 | 2,230 | 5/31/2019 |
6.2.2 | 4,413 | 5/1/2019 |
6.2.1 | 2,224 | 4/5/2019 |
6.2.1-beta4 | 535 | 3/27/2019 |
6.2.1-beta3 | 526 | 3/18/2019 |
6.2.1-beta2 | 536 | 3/17/2019 |
6.2.1-beta1 | 523 | 3/12/2019 |
6.2.0 | 2,615 | 3/4/2019 |
6.1.1 | 1,953 | 2/15/2019 |
6.1.0 | 885 | 2/1/2019 |
6.1.0-beta4 | 612 | 1/14/2019 |
6.1.0-beta3 | 634 | 1/10/2019 |
6.1.0-beta2 | 695 | 1/8/2019 |
6.1.0-beta | 641 | 1/7/2019 |
6.0.0 | 2,734 | 11/6/2018 |
6.0.0-beta | 677 | 10/30/2018 |
5.2.3 | 4,382 | 9/20/2018 |
5.2.2 | 10,570 | 7/4/2018 |
5.2.1 | 2,526 | 5/10/2018 |
5.2.0 | 1,256 | 4/29/2018 |
5.1.0 | 1,302 | 3/20/2018 |
5.0.0 | 1,225 | 3/13/2018 |
5.0.0-alpha | 960 | 3/12/2018 |
4.2.1 | 1,935 | 3/5/2018 |
4.2.0 | 1,742 | 3/1/2018 |
4.1.0 | 2,164 | 12/5/2017 |
4.1.0-alpha | 1,062 | 10/27/2017 |
4.0.0 | 1,620 | 10/23/2017 |
3.2.0 | 1,878 | 10/14/2017 |
3.1.2 | 1,870 | 9/16/2017 |
3.1.0 | 2,491 | 12/15/2016 |
3.0.0 | 1,941 | 12/6/2016 |
3.0.0-beta | 1,218 | 11/29/2016 |
3.0.0-alpha | 1,278 | 11/23/2016 |
2.3.3 | 1,794 | 11/10/2016 |
2.3.1 | 1,777 | 10/29/2016 |
2.3.0 | 1,396 | 10/28/2016 |
2.2.0 | 2,055 | 2/1/2016 |
2.1.0 | 1,255 | 1/16/2016 |
2.0.0 | 1,634 | 1/15/2016 |
2.0.0-alpha | 1,229 | 5/16/2015 |
1.0.1.20018 | 2,141 | 6/1/2014 |
1.0.0.33376 | 2,172 | 4/26/2014 |
0.5.3 | 1,720 | 2/3/2014 |
0.5.2 | 1,311 | 1/26/2014 |
0.5.0.1 | 2,529 | 1/12/2014 |
0.4.2-alpha | 1,248 | 6/16/2013 |
0.4.1-alpha | 1,177 | 6/15/2013 |
0.4.0-alpha | 1,209 | 6/14/2013 |
Version 11.2.0
- Version bump to accomodate SoundFingerprinting.Emy update to 11.2.0
Version 11.1.0
- Adjusting `IRealtimeResultEntryFilter` implementation classes, making sure we adhere to the same principles across both configuration options `ResultEntryFilter` and `OngoingResultEntryFilter`.
- Adding `ChainedRealtimeEntryFilter` which replaces `OngoingRealtimeResultEntryFilter`.
- `TrackMatchLengthEntryFilter` can be used as `TrackCoverageLengthEntryFilter(waitTillCompletion: false)`
- `OngoingRealtimeResultEntryFilter(x, y)` can be chained as new ChainedRealtimeEntryFilter([new TrackRelativeCoverageEntryFilter(x, waitTillCompletion: false), new TrackCoverageLengthEntryFilter(y, waitTillCompletion: false)`])
- `TrackRelativeCoverageLengthEntryFilter` can be used as `TrackRelativeCoverageEntryFilter(waitTillCompletion: true)`.
Version 11.0.0
- Cleanup of the codebase, removing obsolete methods and classes.
- Removed IAdvancedModelService.
- Removed obsolete Query methods from IQueryService, leaving only QueryEfficient method.
- Removed MediaType from QueryConfiguration, as it was not used anymore.
Version 10.4.0
- Added new property MetaFields to AVQueryMatch, to be able to store additional metadata about the match.
Version 10.3.0
- Improved the ability to reconstruct coverage from tone signal matches (silence can be treated as a tone signal).
- Added a fingerprinting flag that allows including silence fingerprints in the generated result set.
Version 10
- Accomodating SoundFingerprinting.Emy upgrade to FFmpeg 6.x
Version 9.5.0
- Dependency upgrade to .NET 8
Version 9.4.2
- Contains an important improvement to QueryCommand, allowing a much faster lookup for the use-cases when the track and the query are very long and almost identical.
- The improvement is rooted in the idea of returning Candidates from the IModelService instead of list of SubFingerprints. This provided knowledge on which hashed fingerprint matched with the query.
- To further improve the API design, methods related to the correct functioning of the QueryCommand were extracted into IQueryService interface (from which IModelService derives).