Microsoft.KernelMemory.SemanticKernelPlugin 0.34.240313.1

Prefix Reserved

This package has a SemVer 2.0.0 package version: 0.34.240313.1+2fe693a.

There is a newer version of this package available.
See the version list below for details.

dotnet add package Microsoft.KernelMemory.SemanticKernelPlugin --version 0.34.240313.1

NuGet\Install-Package Microsoft.KernelMemory.SemanticKernelPlugin -Version 0.34.240313.1

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="Microsoft.KernelMemory.SemanticKernelPlugin" Version="0.34.240313.1" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

paket add Microsoft.KernelMemory.SemanticKernelPlugin --version 0.34.240313.1

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: Microsoft.KernelMemory.SemanticKernelPlugin, 0.34.240313.1"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

// Install Microsoft.KernelMemory.SemanticKernelPlugin as a Cake Addin
#addin nuget:?package=Microsoft.KernelMemory.SemanticKernelPlugin&version=0.34.240313.1

// Install Microsoft.KernelMemory.SemanticKernelPlugin as a Cake Tool
#tool nuget:?package=Microsoft.KernelMemory.SemanticKernelPlugin&version=0.34.240313.1

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Kernel Memory

Kernel Memory (KM) is a multi-modal AI Service specialized in the efficient indexing of datasets through custom continuous data hybrid pipelines, with support for Retrieval Augmented Generation (RAG), synthetic memory, prompt engineering, and custom semantic memory processing.

KM includes a GPT Plugin, web clients, a .NET library for embedded applications, and as a Docker container.

Utilizing advanced embeddings and LLMs, the system enables Natural Language querying for obtaining answers from the indexed data, complete with citations and links to the original sources.

Designed for seamless integration as a Plugin with Semantic Kernel, Microsoft Copilot and ChatGPT, Kernel Memory enhances data-driven features in applications built for most popular AI platforms.

Repository Guidance

This repository presents best practices and a reference architecture for memory in specific AI and LLMs application scenarios. Please note that the provided code serves as a demonstration and is not an officially supported Microsoft offering.

Kernel Memory (KM) and Semantic Memory (SM)

Semantic Memory (SM) is a library for C#, Python, and Java that wraps direct calls to databases and supports vector search. It was developed as part of the Semantic Kernel (SK) project and serves as the first public iteration of long-term memory. The core library is maintained in three languages, while the list of supported storage engines (known as "connectors") varies across languages.

Kernel Memory (KM) is a service built on the feedback received and lessons learned from developing Semantic Kernel (SK) and Semantic Memory (SM). It provides several features that would otherwise have to be developed manually, such as storing files, extracting text from files, providing a framework to secure users' data, etc. The KM codebase is entirely in .NET, which eliminates the need to write and maintain features in multiple languages. As a service, KM can be used from any language, tool, or platform, e.g. browser extensions and ChatGPT assistants.

Here's a few notable differences:

Feature	Semantic Memory	Kernel Memory
Data formats	Text only	Web pages, PDF, Images, Word, PowerPoint, Excel, Markdown, Text, JSON, more being added
Search	Cosine similarity	Cosine similarity, Hybrid search with filters, AND/OR conditions
Language support	C#, Python, Java	Any language, command line tools, browser extensions, low-code/no-code apps, chatbots, assistants, etc.
Storage engines	Azure AI Search, Chroma, DuckDB, Kusto, Milvus, MongoDB, Pinecone, Postgres, Qdrant, Redis, SQLite, Weaviate	Azure AI Search, Elasticsearch, Postgres, Qdrant, Redis, SQL Server, In memory KNN, On disk KNN. In progress: Chroma

and features available only in Kernel Memory:

RAG (Retrieval Augmented Generation)
RAG sources lookup
Summarization
Security Filters (filter memory by users and groups)
Long running ingestion, large documents, with retry logic and durable queues
Custom tokenization
Document storage
OCR via Azure Document Intelligence
LLMs (Large Language Models) with dedicated tokenization
Cloud deployment
OpenAPI
Custom storage schema (partially implemented/work in progress)
Short Term Memory (partially implemented/work in progress)
Concurrent write to multiple vector DBs

Supported Data formats and Backends

📝 MS Office: Word, Excel, PowerPoint
📃 PDF documents
🌐 Fetch web pages and HTML files
🖼️ JPG/PNG/TIFF Images with text via OCR
📄 MarkDown and Raw plain text
💻 JSON files
💡 AI: Azure OpenAI, OpenAI, LLama - thanks to llama.cpp and LLamaSharp, Azure Document Intelligence
🧠 Vector storage: Azure AI Search, Postgres+pgvector, Qdrant, MSSQL Server (third party), Elasticsearch (third party), Redis, Chroma (work in progress), KNN vectors in memory (volatile), KNN vectors on disk (persistent).
🗂 Content storage: Azure Blobs, Local file system, In memory content (volatile).
⏳ Orchestration: Azure Queues, RabbitMQ, Local file based queues, In memory queues (volatile).

Kernel Memory in serverless mode

Kernel Memory works and scales at best when running as a service, allowing to ingest thousands of documents and information without blocking your app.

However, you can use Kernel Memory also serverless, embedding the MemoryServerless class in your app.

Importing documents into your Kernel Memory can be as simple as this:

var memory = new KernelMemoryBuilder()
    .WithOpenAIDefaults(Env.Var("OPENAI_API_KEY"))
    .Build<MemoryServerless>();

// Import a file
await memory.ImportDocumentAsync("meeting-transcript.docx", tags: new() { { "user", "Blake" } });

// Import multiple files and apply multiple tags
await memory.ImportDocumentAsync(new Document("file001")
    .AddFile("business-plan.docx")
    .AddFile("project-timeline.pdf")
    .AddTag("user", "Blake")
    .AddTag("collection", "business")
    .AddTag("collection", "plans")
    .AddTag("fiscalYear", "2023"));

Asking questions:

var answer1 = await memory.AskAsync("How many people attended the meeting?");

var answer2 = await memory.AskAsync("what's the project timeline?", filter: new MemoryFilter().ByTag("user", "Blake"));

The code leverages the default documents ingestion pipeline:

Extract text: recognize the file format and extract the information
Partition the text in small chunks, to optimize search
Extract embedding using an LLM embedding generator
Save embedding into a vector index such as Azure AI Search, Qdrant or other DBs.

Documents are organized by users, safeguarding their private information. Furthermore, memories can be categorized and structured using tags, enabling efficient search and retrieval through faceted navigation.

Data lineage, citations

All memories and answers are fully correlated to the data provided. When producing an answer, Kernel Memory includes all the information needed to verify its accuracy:

await memory.ImportFileAsync("NASA-news.pdf");

var answer = await memory.AskAsync("Any news from NASA about Orion?");

Console.WriteLine(answer.Result + "/n");

foreach (var x in answer.RelevantSources)
{
    Console.WriteLine($"  * {x.SourceName} -- {x.Partitions.First().LastUpdate:D}");
}

Yes, there is news from NASA about the Orion spacecraft. NASA has invited the media to see a new test version of the Orion spacecraft and the hardware that will be used to recover the capsule and astronauts upon their return from space during the Artemis II mission. The event is scheduled to take place at Naval Base San Diego on Wednesday, August 2, at 11 a.m. PDT. Personnel from NASA, the U.S. Navy, and the U.S. Air Force will be available to speak with the media. Teams are currently conducting tests in the Pacific Ocean to demonstrate and evaluate the processes, procedures, and hardware for recovery operations for crewed Artemis missions. These tests will help prepare the team for Artemis II, which will be NASA's first crewed mission under the Artemis program. The Artemis II crew, consisting of NASA astronauts Reid Wiseman, Victor Glover, and Christina Koch, and Canadian Space Agency astronaut Jeremy Hansen, will participate in recovery testing at sea next year. For more information about the Artemis program, you can visit the NASA website.

NASA-news.pdf -- Tuesday, August 1, 2023

Using Kernel Memory Service

Depending on your scenarios, you might want to run all the code locally inside your process, or remotely through an asynchronous service.

If you're importing small files, and need only C# and can block the process during the import, local-in-process execution can be fine, using the MemoryServerless seen above.

However, if you are in one of these scenarios:

I'd just like a web service to import data and send queries to answer
My app is written in TypeScript, Java, Rust, or some other language
I want to define custom pipelines mixing multiple languages like Python, TypeScript, etc
I'm importing big documents that can require minutes to process, and I don't want to block the user interface
I need memory import to run independently, supporting failures and retry logic

then you can deploy Kernel Memory as a service, plugging in the default handlers, or your custom Python/TypeScript/Java/etc. handlers, and leveraging the asynchronous non-blocking memory encoding process, sending documents and asking questions using the MemoryWebClient.

Here you can find a complete set of instruction about how to run the Kernel Memory service.

Quick test using the Docker image

If you want to give the service a quick test, use the following command to start the Kernel Memory Service using OpenAI:

docker run -e OPENAI_API_KEY="..." -it --rm -p 9001:9001 kernelmemory/service

If you prefer using custom settings and services such as Azure OpenAI, Azure Document Intelligence, etc., you should create an appsettings.Development.json file overriding the default values set in appsettings.json, or using the configuration wizard included:

cd service/Service
dotnet run setup

Then run this command to start the Docker image with the configuration just created:

docker run --volume ./appsettings.Development.json:/app/appsettings.Production.json \
     -it --rm -p 9001:9001 kernelmemory/service

To import files using Kernel Memory web service, use `MemoryWebClient`:

#reference clients/WebClient/WebClient.csproj

var memory = new MemoryWebClient("http://127.0.0.1:9001"); // <== URL where the web service is running

// Import a file (default user)
await memory.ImportDocumentAsync("meeting-transcript.docx");

// Import a file specifying a Document ID, User and Tags
await memory.ImportDocumentAsync("business-plan.docx",
    new DocumentDetails("user@some.email", "file001")
        .AddTag("collection", "business")
        .AddTag("collection", "plans")
        .AddTag("fiscalYear", "2023"));

Getting answers via the web service

curl http://127.0.0.1:9001/ask -d'{"query":"Any news from NASA about Orion?"}' -H 'Content-Type: application/json'

{
  "Query": "Any news from NASA about Orion?",
  "Text": "Yes, there is news from NASA about the Orion spacecraft. NASA has invited the media to see a new test version of the Orion spacecraft and the hardware that will be used to recover the capsule and astronauts upon their return from space during the Artemis II mission. The event is scheduled to take place at Naval Base San Diego on August 2nd at 11 a.m. PDT. Personnel from NASA, the U.S. Navy, and the U.S. Air Force will be available to speak with the media. Teams are currently conducting tests in the Pacific Ocean to demonstrate and evaluate the processes, procedures, and hardware for recovery operations for crewed Artemis missions. These tests will help prepare the team for Artemis II, which will be NASA's first crewed mission under the Artemis program. The Artemis II crew, consisting of NASA astronauts Reid Wiseman, Victor Glover, and Christina Koch, and Canadian Space Agency astronaut Jeremy Hansen, will participate in recovery testing at sea next year. For more information about the Artemis program, you can visit the NASA website.",
  "RelevantSources": [
    {
      "Link": "...",
      "SourceContentType": "application/pdf",
      "SourceName": "file5-NASA-news.pdf",
      "Partitions": [
        {
          "Text": "Skip to main content\nJul 28, 2023\nMEDIA ADVISORY M23-095\nNASA Invites Media to See Recovery Craft for\nArtemis Moon Mission\n(/sites/default/ﬁles/thumbnails/image/ksc-20230725-ph-fmx01_0003orig.jpg)\nAboard the USS John P. Murtha, NASA and Department of Defense personnel practice recovery operations for Artemis II in July. A\ncrew module test article is used to help verify the recovery team will be ready to recovery the Artemis II crew and the Orion spacecraft.\nCredits: NASA/Frank Michaux\nMedia are invited to see the new test version of NASA’s Orion spacecraft and the hardware teams will use\nto recover the capsule and astronauts upon their return from space during the Artemis II\n(http://www.nasa.gov/artemis-ii) mission. The event will take place at 11 a.m. PDT on Wednesday, Aug. 2,\nat Naval Base San Diego.\nPersonnel involved in recovery operations from NASA, the U.S. Navy, and the U.S. Air Force will be\navailable to speak with media.\nU.S. media interested in attending must RSVP by 4 p.m., Monday, July 31, to the Naval Base San Diego\nPublic Aﬀairs (mailto:nbsd.pao@us.navy.mil) or 619-556-7359.\nOrion Spacecraft (/exploration/systems/orion/index.html)\nNASA Invites Media to See Recovery Craft for Artemis Moon Miss... https://www.nasa.gov/press-release/nasa-invites-media-to-see-recov...\n1 of 3 7/28/23, 4:51 PMTeams are currently conducting the ﬁrst in a series of tests in the Paciﬁc Ocean to demonstrate and\nevaluate the processes, procedures, and hardware for recovery operations (https://www.nasa.gov\n/exploration/systems/ground/index.html) for crewed Artemis missions. The tests will help prepare the\nteam for Artemis II, NASA’s ﬁrst crewed mission under Artemis that will send four astronauts in Orion\naround the Moon to checkout systems ahead of future lunar missions.\nThe Artemis II crew – NASA astronauts Reid Wiseman, Victor Glover, and Christina Koch, and CSA\n(Canadian Space Agency) astronaut Jeremy Hansen – will participate in recovery testing at sea next year.\nFor more information about Artemis, visit:\nhttps://www.nasa.gov/artemis (https://www.nasa.gov/artemis)\n-end-\nRachel Kraft\nHeadquarters, Washington\n202-358-1100\nrachel.h.kraft@nasa.gov (mailto:rachel.h.kraft@nasa.gov)\nMadison Tuttle\nKennedy Space Center, Florida\n321-298-5868\nmadison.e.tuttle@nasa.gov (mailto:madison.e.tuttle@nasa.gov)\nLast Updated: Jul 28, 2023\nEditor: Claire O’Shea\nTags:  Artemis (/artemisprogram),Ground Systems (http://www.nasa.gov/exploration/systems/ground\n/index.html),Kennedy Space Center (/centers/kennedy/home/index.html),Moon to Mars (/topics/moon-to-\nmars/),Orion Spacecraft (/exploration/systems/orion/index.html)\nNASA Invites Media to See Recovery Craft for Artemis Moon Miss... https://www.nasa.gov/press-release/nasa-invites-media-to-see-recov...\n2 of 3 7/28/23, 4:51 PM",
          "Relevance": 0.8430657,
          "SizeInTokens": 863,
          "LastUpdate": "2023-08-01T08:15:02-07:00"
        }
      ]
    }
  ]
}

You can find a full example here.

Custom memory ingestion pipelines

On the other hand, if you need a custom data pipeline, you can also customize the steps, which will be handled by your custom business logic:

// Memory setup, e.g. how to calculate and where to store embeddings
var memoryBuilder = new KernelMemoryBuilder()
    .WithoutDefaultHandlers()
    .WithOpenAIDefaults(Env.Var("OPENAI_API_KEY"));

var memory = memoryBuilder.Build();

// Plug in custom .NET handlers
memory.Orchestrator.AddHandler<MyHandler1>("step1");
memory.Orchestrator.AddHandler<MyHandler2>("step2");
memory.Orchestrator.AddHandler<MyHandler3>("step3");

// Use the custom handlers with the memory object
await memory.ImportDocumentAsync(
    new Document("mytest001")
        .AddFile("file1.docx")
        .AddFile("file2.pdf"),
    steps: new[] { "step1", "step2", "step3" });

Web API specs

The API schema is available at http://127.0.0.1:9001/swagger/index.html when running the service locally with OpenAPI enabled.

Examples and Tools

Examples

Tools

.NET packages

Microsoft.KernelMemory.WebClient: The web client library, can be used to call a running instance of the Memory web service. .NET Standard 2.0 compatible.
Microsoft.KernelMemory.SemanticKernelPlugin: a Memory plugin for Semantic Kernel, replacing the original Semantic Memory available in SK. .NET Standard 2.0 compatible.
Microsoft.KernelMemory.Abstractions: The internal interfaces and models shared by all packages, used to extend KM to support third party services. .NET Standard 2.0 compatible.
Microsoft.KernelMemory.MemoryDb.AzureAISearch: Memory storage using Azure AI Search.
Microsoft.KernelMemory.MemoryDb.Postgres: Memory storage using PostgreSQL.
Microsoft.KernelMemory.MemoryDb.Qdrant: Memory storage using Qdrant.
Microsoft.KernelMemory.AI.AzureOpenAI: Integration with Azure OpenAI LLMs.
Microsoft.KernelMemory.AI.LlamaSharp: Integration with LLama LLMs.
Microsoft.KernelMemory.AI.OpenAI: Integration with OpenAI LLMs.
Microsoft.KernelMemory.DataFormats.AzureAIDocIntel: Integration with Azure AI Document Intelligence.
Microsoft.KernelMemory.Orchestration.AzureQueues: Ingestion and synthetic memory pipelines via Azure Queue Storage.
Microsoft.KernelMemory.Orchestration.RabbitMQ: Ingestion and synthetic memory pipelines via RabbitMQ.
Microsoft.KernelMemory.ContentStorage.AzureBlobs: Used to store content on Azure Storage Blobs.
Microsoft.KernelMemory.Core: The core library, can be used to build custom pipelines and handlers, and contains a serverless client to use memory in a synchronous way, without the web service. .NET 6+.

Packages for Python, Java and other languages

Kernel Memory service offers a Web API out of the box, including the OpenAPI swagger documentation that you can leverage to test the API and create custom web clients. For instance, after starting the service locally, see http://127.0.0.1:9001/swagger/index.html.

A python package with a Web Client and Semantic Kernel plugin will soon be available. We also welcome PR contributions to support more languages.

Product	Compatible and additional computed target framework versions.
.NET	net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed.
.NET Core	netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed.
.NET Standard	netstandard2.0 is compatible. netstandard2.1 was computed.
.NET Framework	net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed.
MonoAndroid	monoandroid was computed.
MonoMac	monomac was computed.
MonoTouch	monotouch was computed.
Tizen	tizen40 was computed. tizen60 was computed.
Xamarin.iOS	xamarinios was computed.
Xamarin.Mac	xamarinmac was computed.
Xamarin.TVOS	xamarintvos was computed.
Xamarin.WatchOS	xamarinwatchos was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

.NETStandard 2.0
- Microsoft.KernelMemory.WebClient (>= 0.34.240313.1)
- Microsoft.SemanticKernel.Abstractions (>= 1.6.1)

NuGet packages (1)

Showing the top 1 NuGet packages that depend on Microsoft.KernelMemory.SemanticKernelPlugin:

Package	Downloads
Microsoft.KernelMemory The package contains all the core logic and extensions of Kernel Memory, to index and query any data and documents, using LLM and natural language, tracking sources and showing citations.	2.7K

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last updated
0.93.241118.1	110	11/19/2024
0.92.241112.1	1,170	11/12/2024
0.91.241101.1	5,439	11/1/2024
0.91.241031.1	169	10/31/2024
0.90.241021.1	2,141	10/22/2024
0.90.241020.3	334	10/20/2024
0.80.241017.2	260	10/17/2024
0.79.241014.2	380	10/14/2024
0.79.241014.1	51	10/14/2024
0.78.241007.1	565	10/8/2024
0.78.241005.1	329	10/6/2024
0.77.241004.1	66	10/5/2024
0.76.240930.3	1,207	9/30/2024
0.75.240924.1	596	9/24/2024
0.74.240919.1	852	9/19/2024
0.73.240906.1	15,210	9/7/2024
0.72.240904.1	314	9/5/2024
0.71.240820.1	3,959	8/21/2024
0.70.240803.1	8,724	8/3/2024
0.69.240727.1	2,739	7/27/2024
0.68.240722.1	1,631	7/22/2024
0.68.240716.1	566	7/16/2024
0.67.240712.1	747	7/12/2024
0.66.240709.1	877	7/9/2024
0.65.240620.1	11,374	6/21/2024
0.64.240619.1	125	6/20/2024
0.63.240618.1	1,446	6/18/2024
0.62.240605.1	6,492	6/5/2024
0.62.240604.1	123	6/4/2024
0.61.240524.1	9,339	5/24/2024
0.61.240519.2	2,365	5/19/2024
0.60.240517.1	141	5/18/2024
0.51.240513.2	517	5/13/2024
0.50.240504.7	2,255	5/4/2024
0.50.240504.4	111	5/4/2024
0.50.240504.3	88	5/4/2024
0.50.240502.2	94	5/3/2024
0.40.240501.1	181	5/1/2024
0.39.240427.1	475	4/28/2024
0.38.240425.1	425	4/25/2024
0.38.240423.1	638	4/24/2024
0.37.240420.2	438	4/21/2024
0.36.240416.1	9,816	4/16/2024
0.36.240415.2	316	4/16/2024
0.36.240415.1	142	4/15/2024
0.35.240412.2	205	4/12/2024
0.35.240321.1	2,198	3/21/2024
0.35.240318.1	598	3/18/2024
0.34.240313.1	1,361	3/13/2024
0.33.240312.1	240	3/12/2024
0.32.240308.1	1,050	3/8/2024
0.32.240307.3	531	3/7/2024
0.32.240307.2	106	3/7/2024
0.30.240227.1	1,381	2/28/2024
0.29.240219.2	1,245	2/20/2024
0.28.240212.1	1,405	2/13/2024
0.27.240207.1	650	2/7/2024
0.27.240205.2	1,335	2/6/2024
0.27.240205.1	97	2/5/2024
0.26.240121.1	4,018	1/22/2024
0.26.240116.2	1,260	1/16/2024
0.26.240115.4	464	1/16/2024
0.26.240104.1	1,454	1/5/2024
0.25.240103.1	163	1/4/2024
0.24.231228.5	398	12/29/2023
0.24.231228.4	97	12/29/2023
0.23.231224.1	231	12/24/2023
0.23.231221.1	110	12/22/2023
0.23.231219.1	335	12/20/2023
0.22.231217.1	106	12/18/2023
0.21.231214.1	133	12/15/2023
0.20.231212.1	282	12/13/2023
0.19.231211.1	186	12/11/2023