Skip to content

Commit a25d136

Browse files
authored
Enhance README with Azure embedding model details
Updated sections in README to include detailed recommendations for Azure embedding models, including Azure OpenAI, Azure AI Foundry, and Azure Marketplace. Improved formatting and added example use cases and workflows.
1 parent 7686d10 commit a25d136

1 file changed

Lines changed: 17 additions & 5 deletions

File tree

README.md

Lines changed: 17 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,12 @@ Last updated: 2025-08-20
2121
<details>
2222
<summary><b>Table of Contents</b> (Click to expand)</summary>
2323

24+
- [Recommendations for Alternative & Upcoming Embedding Models in Azure](#recommendations-for-alternative--upcoming-embedding-models-in-azure)
25+
- [Azure OpenAI](#azure-openai)
26+
- [Azure AI Foundry](#azure-ai-foundry)
27+
- [Azure Marketplace](#azure-marketplace)
28+
- [How Azure AI Search Can Help Optimize Embedding Performance](#how-azure-ai-search-can-help-optimize-embedding-performance)
29+
2430
</details>
2531

2632

@@ -46,7 +52,8 @@ Last updated: 2025-08-20
4652
> `Upcoming Models to Watch:`
4753
- **Voyage-3-Large**: Expected to outperform OpenAI v3-large with flexible dimensions and quantization. [voyage-3-large: the new state-of-the-art general-purpose embedding model](https://statics.teams.cdn.office.net/evergreen-assets/safelinks/2/atp-safelinks.html)
4854

49-
> Azure OpenAI
55+
### Azure OpenAI
56+
5057
> - `text-embedding-ada-002`
5158
> - `text-embedding-3-large`
5259
> - `text-embedding-3-small`
@@ -61,7 +68,8 @@ Last updated: 2025-08-20
6168

6269
<img width="1893" height="695" alt="image" src="https://github.com/user-attachments/assets/8d822c1d-04fd-49ba-a514-1a3f74e24cdc" />
6370

64-
> Azure AI Foundry
71+
### Azure AI Foundry
72+
6573
> - `embed-v-4-0`
6674
> - `Cohere-embed-v3-multilingual`
6775
> - `Cohere-embed-v3-english`
@@ -77,6 +85,8 @@ Last updated: 2025-08-20
7785

7886
<img width="1907" height="847" alt="image" src="https://github.com/user-attachments/assets/45b1676c-8c5b-4084-be19-4eeb56e8f961" />
7987

88+
### Azure Marketplace
89+
8090
> [Azure Marketplace](https://azuremarketplace.microsoft.com/en-us/marketplace/apps?search=embedding&page=1):
8191
> - `voyage-3 Embedding Model`
8292
> - `voyage-3-lite Embedding Model`
@@ -103,7 +113,8 @@ Last updated: 2025-08-20
103113
- Handles **large-scale indexing** and **low-latency retrieval**, ideal for enterprise-grade applications with millions of documents.
104114
- Integrates with **hybrid search pipelines**, combining vector and keyword search to improve both precision and recall.
105115

106-
**Example Use Cases**:
116+
> Example Use Cases:
117+
107118
- Intelligent document retrieval for legal or financial archives.
108119
- FAQ matching and chatbot grounding using semantic similarity.
109120
- Product recommendation systems based on user intent embeddings.
@@ -118,7 +129,7 @@ Last updated: 2025-08-20
118129
1. **Choose embedding size wisely**: Smaller embeddings (e.g., 512–1024) reduce latency and storage costs, ideal for mobile or real-time apps.
119130
2. **Batch embedding generation**: Pre-process documents in bulk to reduce API calls and improve throughput.
120131
3. **Use domain-specific models**: Models like `voyage-finance-2` or `voyage-law-2` yield better semantic relevance in specialized contexts.
121-
4. **Monitor vector DB costs**: Larger embeddings increase storage and query costsbalance precision with efficiency.
132+
4. **Monitor vector DB costs**: Larger embeddings increase storage and query costs, balance precision with efficiency.
122133
5. **Leverage hybrid search**: Combine keyword and vector search to handle both exact and fuzzy matches, especially in noisy datasets.
123134
6. **Normalize and deduplicate embeddings**: Ensure consistent vector quality and avoid redundant indexing.
124135

@@ -134,7 +145,8 @@ Last updated: 2025-08-20
134145
- **Query using embedded vectors** to perform semantic matching, enabling intelligent document retrieval, contextual search, and RAG workflows.
135146
- **Automate updates**: Use Azure Functions or Logic Apps to refresh embeddings when documents change, keeping your index up-to-date.
136147

137-
**Example Workflow**:
148+
> Example Workflow:
149+
138150
1. Use `text-embedding-3-large` to embed support tickets.
139151
2. Store vectors in Azure AI Search.
140152
3. Query with user questions to retrieve semantically similar tickets.

0 commit comments

Comments
 (0)