# Briefing Similarity Scoring Methods of Contextual Embeddings

The main objective in this post is to build a system that:

1. Utilize the sentence embeddings to create a knowledge graph and replicate the human's temporal memory in application.&#x20;
2. Scope for strategies to integrate the sentence embeddings into Large language models' session caches.

**Prior Knowledge**

* **Sentence Embeddings**:&#x20;
  * Represents the entire sentences as dense, fixed-size vectors in a high-dimensional space. The dimensions depend on the dimensions of the transformer's architecture layer.&#x20;
  * Converting tokens to sentence embeddings include applying a strategy, known as `mean pooling` , to the token vector embeddings.
  * The common models used to retrieve sentence embeddings are BERT, RoBerta or dedicated models like SBERT. These models are different to text generative models in that they come without a head (or decoder state) and only encoder is loaded / used.&#x20;
* **Similarity Scoring**: measures how 'close' or 'similar' two sentence embeddings are in vector space. Higher similarity scores imply higher semantic relatedness. This post explores different methods including cosine similarity, dot product and Euclidean distance.

**Dependencies**

{% code overflow="wrap" %}

```
sentence-transformers # Used to load pre-trained LLMs to tokenize and encode the text sequences. In this post, all-distilroberta-v1 was used. 
numpy 
```

{% endcode %}

**Notebook Overview**

* A brief comparison of various similarity scoring methods for vector embeddings, including Cosine Similarity, Manhattan Distance, and more.
* Initialising a base class to generate scores in real-time during streaming conversational chats in the instance where a vector database or self-organizing map is used as a generative model's temporal memory.

{% embed url="<https://gist.github.com/whoamimi/81c1707cafcacfade9c5f6f5b328b11b>" %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://whoamimi.gitbook.io/blog/ai-ml-and-data-science-development/large-language-models/briefing-similarity-scoring-methods-of-contextual-embeddings.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.