Changelog

All notable changes to this project will be documented in this file.
This format follows Keep a Changelog and adheres to Semantic Versioning.

[v2.6.6] - 2026-04-20

Added

Multimodal figure/image extraction for Content Understanding (Azure/GPT-RAG#446): When using Content Understanding as the document analysis backend (USE_DOCUMENT_INTELLIGENCE=false), the multimodal chunker now extracts figures from documents, uploads them to the documents-images blob container, generates captions using a vision-capable model, and populates relatedImages, imageCaptions, and captionVector fields in the search index — achieving full multimodal parity with the Document Intelligence path. Supports PDF (PyMuPDF page rendering with bounding-box crop), DOCX (word/media/ ZIP extraction), and PPTX (ppt/media/ ZIP extraction). The ContentUnderstandingClient now parses and returns figure and page metadata from the API response instead of discarding it. New dependencies: PyMuPDF, python-docx, python-pptx.

Changed

Bumped gpt-rag-ingestion to v2.3.3.

Tested Service Versions

Component	Version
gpt-rag-ui	v2.3.1
gpt-rag-orchestrator	v2.6.2
gpt-rag-ingestion	v2.3.3
infra (landing zone)	v1.0.7

[v2.6.5] - 2026-04-18

Fixed

OpenTelemetry version pinning (orchestrator): Pinned azure-monitor-opentelemetry==1.8.7, azure-monitor-opentelemetry-exporter==1.0.0b49, opentelemetry-instrumentation-httpx==0.61b0, and opentelemetry-instrumentation-fastapi==0.61b0 in requirements.txt. Unpinned versions caused non-deterministic Docker builds where an older exporter (referencing the removed LogData class) could be paired with opentelemetry-sdk>=1.39.0, crashing the container on startup with ImportError: cannot import name 'LogData' from 'opentelemetry.sdk._logs'. (#445)
Permission trimming header format (orchestrator): Removed erroneous Bearer prefix from the x-ms-query-source-authorization header value in both the REST API path (search.py) and the SDK path (search_context_provider.py). Azure AI Search expects the raw OBO token without the prefix; including it caused 400 Invalid header errors when permissionFilterOption was enabled on the search index. (#447)

Changed

Bumped gpt-rag-orchestrator to v2.6.2.

[v2.6.4] - 2026-04-14

Fixed

Restored missing parent_id field in the RAG search index template (config/search/search.j2), which was accidentally removed during the v2.6.0 merge. This caused gpt-rag-ingestion blob storage and SharePoint indexers to fail with Could not find a property named 'parent_id' errors.

Changed

Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.7, fixing Log Analytics provisioning failure in Sweden Central caused by forceCmkForQuery default.

[v2.6.3] - 2026-04-08

Changed

Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.6.
Parametrized Container App CPU and memory per app entry with fallback defaults (0.5 CPU / 1.0Gi).
Increased dataingest Container App resources to 1.0 CPU and 3.0Gi memory.
Increased text-embedding-3-large deployment capacity from 40 to 100.
Bumped gpt-rag-ingestion to v2.3.2.

[v2.6.2] - 2026-04-01

Changed

Bumped gpt-rag-orchestrator to v2.6.1.

[v2.6.1] - 2026-04-01

Fixed

Fixed Zero Trust provisioning failure caused by jumpbox Custom Script Extension using incorrect release tag. Replaced install_script URL field with ailz_tag in manifest.json, allowing the install script URL and release parameter to be derived from the landing zone tag.

Changed

Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.5.
Bumped gpt-rag-ui to v2.3.1.
Bumped gpt-rag-ingestion to v2.2.5.

[v2.6.0] - 2026-03-31

Changed

Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.4.
Bumped gpt-rag-ui to v2.3.0.
Bumped gpt-rag-orchestrator to v2.6.0.
Bumped gpt-rag-ingestion to v2.2.4.
Added explicit partitionKey to all Cosmos DB container definitions, including /principal_id for conversations container.
Added conversation-documents storage container.
Added conversationId filterable field to search index.
Removed standalone MCP Container App from default deployment (consolidated into orchestrator).

[v2.5.3] - 2026-03-24

Changed

Updated default chat model from gpt-5-mini to gpt-5-nano (2025-08-07), increased deployment capacity to 100, and set API version to 2025-12-01-preview.
Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.3.
Bumped gpt-rag-ui to v2.2.3.
Bumped gpt-rag-orchestrator to v2.5.0.

Added

Added repository development and release instructions (.github/copilot-instructions.md).

[v2.5.2] - 2026-03-16

Changed

Updated pre-deployment behavior to skip cloning a component repository when it already exists locally, improving repeat deployment workflows and avoiding unnecessary clone failures. Closes #428.

Fixed

Made virtual environment cleanup in scripts/postProvision.sh non-fatal so post-provisioning continues even if cleanup cannot complete. Closes #426.

[v2.5.1] - 2026-03-06

Changed

Updated infra submodule to external bicep-ptn-aiml-landing-zone tag v1.0.1.
Bumped gpt-rag-orchestrator to v2.4.2.
Bumped gpt-rag-ui to v2.2.2.
Improved runtime performance by upgrading the Orchestrator and UI components to v2.4.2 and v2.2.2, respectively.
Bumped gpt-rag-ingestion to v2.2.3.

[v2.5.0] - 2026-03-02

Changed

Migrated infra folder to external submodule bicep-ptn-aiml-landing-zone pinned to v1.0.0.

[v2.4.2] - 2026-02-04

Fixed

Updated the Docker image to install Microsoft's current public signing key, fixing build failures caused by SHA-1 signature rejection in newer Debian/apt verification policies (orchestrator).
Fixed Docker builds on ARM-based machines by explicitly setting the target platform to linux/amd64, preventing Azure Container Apps deployment failures.

Changed

Updated the Docker base image.
Standardized on the container best practice of using a non-privileged port (8080) instead of a privileged port (80), reducing the risk of runtime/permission friction and improving stability of long-running ingestion workloads.
Bumped aiohttp to 3.13.3.

[v2.4.1] - 2026-01-20

Changed

Bumped ingestion component version to include reliability improvements for large spreadsheet ingestion.

[v2.4.0] - 2026-01-15

Added

Document-level security enforcement for GPT-RAG using Azure AI Search native ACL/RBAC trimming with end-user identity propagation via x-ms-query-source-authorization. Includes permission-aware indexing metadata (userIds/groupIds/rbacScope), safe-by-default behavior for requests without a valid user token, and optional elevated-read debugging support.

Changed

Bumped chat model to gpt-5-mini.

[v2.3.0] – 2025-12-15

Added

Support for SharePoint Lists in the ingestion component.
Refactored Single Agent Strategy to simplify citation handling. [#161]
Simplified MCP Strategy. [#159]

Changed

Improved robustness of Blob Storage indexing in the ingestion pipeline.
Enhanced data ingestion logging for better observability and troubleshooting.

Tested

Compatibility with Azure direct models for inference in the orchestration layer.

[v2.2.6] – 2025-12-05

Fixed

Fixed Issue #409 by updating the main Bicep template to ensure the SEARCH_CONNECTION_ID app setting points to the correct AI Search connection ID. It was previously pointing to the AI Foundry AI Search dependency.

[v2.2.5] – 2025-12-02

Fixed

Fixed Issue #406 by updating networking and private endpoint configuration to prevent the cosmos_vnet_blocked error in Cosmos DB private-only setups.

Changed

Automated the creation and registration of the Azure AI Search connection, removing the need for the previous manual workaround.

[v2.2.4] – 2025-11-26

Fixed

Fixed a bug in data ingestion component where the Blob storage ingestion process was re-indexing unchanged files when AI Search index had more than 1,000 chunks. Fixed in gpt-rag-ingestion v2.0.6.

Changed

Small update in scripts/postProvision.sh to make the Container Apps API Key check more robust by always converting the USE_CAPP_API_KEY variable to lowercase, even when it is unset.

[v2.2.3] – 2025-11-15

Fixed

Intermittent AI Foundry post provisioning setup authentication timeout by increasing AzureCliCredential and ManagedIdentityCredential process timeout to 30 seconds in config/aifoundry/setup.py
Compatibility with older AZD versions by removing string interpolation syntax from capability host connection arrays in AI Foundry project module (infra/modules/ai-foundry/modules/project/main.bicep lines 229-231)

Changed

Suppressed BCP081 warnings for future-dated API versions (2025-01-01, 2025-04-01, 2025-05-01, 2025-06-01) in AI Foundry project module by adding #disable-next-line directives
Improved PR and Issue templates
Moved documentation to https://aka.ms/gpt-rag-docs
Bumped gpt-rag-mcp to v0.2.3

[v2.2.2] – 2025-11-09

Changed

Updated infra templates to create the data private endpoint for Azure Container Registry when in network isolation mode.
Updated Bastion configuration to retrieve credentials from Key Vault. Users can now simply reset the testvmuser password to access the VM for the first time.

[v2.2.1] – 2025-10-21

Added

Added more troubleshooting logs.

Fixed

Citations 387

[v2.2.0] – 2025-10-16

Added

Bring your own VNet. #370.
Agentic Retrieval. #359.

Fixed

Citation links opens up new chat windows instead of rendering files #387

[v2.1.2] – 2025-10-02

Changed

Fixed a bug in data ingestion component where the SharePoint ingestion process was unnecessarily re-indexing unchanged files.

[v2.1.1] – 2025-09-22

Changed

Limit azd environment variables to the script process (no longer persisted to the user profile) to reduce secret exposure. Resolves #378.
Streamline AI Search provisioning: now creates only the AI Search index. Previously we also created indexers, skillsets, and data sources that are no longer used and caused confusion about expected runtime behavior. Indexing is performed by the gpt-rag-ingestion jobs — see the ingestion docs for how to run, schedule, or troubleshoot ingest jobs. Resolves #377.

[v2.1.0] – 2025-08-31

Added

User Feedback Loop. #358. Documentation.

Changed

Standardized resource group variable as AZURE_RESOURCE_GROUP. #365

[v2.0.5] - 2025-08-26

Fixed

Resolved VM deployment errors when using CustomScriptExtension under network isolation.

[v2.0.4] - 2025-08-21

Added

Updated orchestrator to version 2.0.3, which includes NL2SQL docs and improved settings checks.

[v2.0.3] - 2025-08-19

Fixed

Resolved issue with using Azure Container Apps under a private endpoint in AI Search as a custom web skill.

Added

Blob Storage Data Source Ingestion.
NL2SQL Metadata Ingestion from Blob Storage.

[v2.0.2] - 2025-08-08

Changed

Updated deployment documentation.

[v2.0.1]

Changed

Updated deployment documentation.

Fixed

Resolved deployment issues introduced in v2.0.0.

[v2.0.0] - 2025-07-15

Changed

Major architecture refactor to support the vNext architecture.

FilesExpand file tree

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

[v2.6.6] - 2026-04-20

Added

Changed

Tested Service Versions

[v2.6.5] - 2026-04-18

Fixed

Changed

[v2.6.4] - 2026-04-14

Fixed

Changed

[v2.6.3] - 2026-04-08

Changed

[v2.6.2] - 2026-04-01

Changed

[v2.6.1] - 2026-04-01

Fixed

Changed

[v2.6.0] - 2026-03-31

Changed

[v2.5.3] - 2026-03-24

Changed

Added

[v2.5.2] - 2026-03-16

Changed

Fixed

[v2.5.1] - 2026-03-06

Changed

[v2.5.0] - 2026-03-02

Changed

[v2.4.2] - 2026-02-04

Fixed

Changed

[v2.4.1] - 2026-01-20

Changed

[v2.4.0] - 2026-01-15

Added

Changed

[v2.3.0] – 2025-12-15

Added

Changed

Tested

[v2.2.6] – 2025-12-05

Fixed

[v2.2.5] – 2025-12-02

Fixed

Changed

[v2.2.4] – 2025-11-26

Fixed

Changed

[v2.2.3] – 2025-11-15

Fixed

Changed

[v2.2.2] – 2025-11-09

Changed

[v2.2.1] – 2025-10-21

Added

Fixed

[v2.2.0] – 2025-10-16

Added

Fixed

[v2.1.2] – 2025-10-02

Changed

[v2.1.1] – 2025-09-22

Changed

[v2.1.0] – 2025-08-31

Added

Changed

[v2.0.5] - 2025-08-26

Fixed

[v2.0.4] - 2025-08-21

Added

[v2.0.3] - 2025-08-19

Fixed