LocalAI.Text.Core
0.7.2
Suggested Alternatives
The owner has unlisted this package.
This could mean that the package is deprecated, has security vulnerabilities or shouldn't be used anymore.
dotnet add package LocalAI.Text.Core --version 0.7.2
NuGet\Install-Package LocalAI.Text.Core -Version 0.7.2
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="LocalAI.Text.Core" Version="0.7.2" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="LocalAI.Text.Core" Version="0.7.2" />
<PackageReference Include="LocalAI.Text.Core" />
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add LocalAI.Text.Core --version 0.7.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: LocalAI.Text.Core, 0.7.2"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package LocalAI.Text.Core@0.7.2
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=LocalAI.Text.Core&version=0.7.2
#tool nuget:?package=LocalAI.Text.Core&version=0.7.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
LocalAI.Text.Core
Core text processing infrastructure for LocalAI packages.
Overview
This package provides centralized tokenization and text processing utilities used by LocalAI packages that work with text data (Embedder, Reranker, Translator, etc.).
Features
- Tokenizer Factory: Creates tokenizers from model directories
- Multiple Tokenizer Types: WordPiece, BPE, SentencePiece support
- Vocabulary Loading: JSON and TXT format support
- Batch Encoding: Efficient batch processing with padding
Usage
This is an infrastructure package typically used internally by other LocalAI packages.
using LocalAI.Text;
// Create a tokenizer from model directory
var tokenizer = TokenizerFactory.CreateFromModelDirectory(modelPath);
// Encode text
var encoded = tokenizer.Encode("Hello, world!");
Console.WriteLine($"Tokens: {encoded.InputIds.Length}");
// Decode tokens
var decoded = tokenizer.Decode(encoded.InputIds, skipSpecialTokens: true);
Supported Tokenizers
| Type | Format | Models |
|---|---|---|
| WordPiece | tokenizer.json, vocab.txt | BERT, BGE |
| BPE | tokenizer.json, vocab.json | GPT-2, RoBERTa |
| SentencePiece | tokenizer.model | XLM-R, mBART |
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
-
net10.0
- LocalAI.Core (>= 0.7.2)
- Microsoft.ML.Tokenizers (>= 2.0.0)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
| Version | Downloads | Last Updated |
|---|