Skip to content

Latest commit

 

History

History
217 lines (175 loc) · 12.6 KB

File metadata and controls

217 lines (175 loc) · 12.6 KB

Changelog

All notable changes to this project will be documented in this file.
This format follows Keep a Changelog and adheres to Semantic Versioning.

[v2.6.6] - 2026-04-20

Added

  • Multimodal figure/image extraction for Content Understanding (Azure/GPT-RAG#446): When using Content Understanding as the document analysis backend (USE_DOCUMENT_INTELLIGENCE=false), the multimodal chunker now extracts figures from documents, uploads them to the documents-images blob container, generates captions using a vision-capable model, and populates relatedImages, imageCaptions, and captionVector fields in the search index — achieving full multimodal parity with the Document Intelligence path. Supports PDF (PyMuPDF page rendering with bounding-box crop), DOCX (word/media/ ZIP extraction), and PPTX (ppt/media/ ZIP extraction). The ContentUnderstandingClient now parses and returns figure and page metadata from the API response instead of discarding it. New dependencies: PyMuPDF, python-docx, python-pptx.

Changed

  • Bumped gpt-rag-ingestion to v2.3.3.

Tested Service Versions

Component Version
gpt-rag-ui v2.3.1
gpt-rag-orchestrator v2.6.2
gpt-rag-ingestion v2.3.3
infra (landing zone) v1.0.7

[v2.6.5] - 2026-04-18

Fixed

  • OpenTelemetry version pinning (orchestrator): Pinned azure-monitor-opentelemetry==1.8.7, azure-monitor-opentelemetry-exporter==1.0.0b49, opentelemetry-instrumentation-httpx==0.61b0, and opentelemetry-instrumentation-fastapi==0.61b0 in requirements.txt. Unpinned versions caused non-deterministic Docker builds where an older exporter (referencing the removed LogData class) could be paired with opentelemetry-sdk>=1.39.0, crashing the container on startup with ImportError: cannot import name 'LogData' from 'opentelemetry.sdk._logs'. (#445)
  • Permission trimming header format (orchestrator): Removed erroneous Bearer prefix from the x-ms-query-source-authorization header value in both the REST API path (search.py) and the SDK path (search_context_provider.py). Azure AI Search expects the raw OBO token without the prefix; including it caused 400 Invalid header errors when permissionFilterOption was enabled on the search index. (#447)

Changed

  • Bumped gpt-rag-orchestrator to v2.6.2.

[v2.6.4] - 2026-04-14

Fixed

  • Restored missing parent_id field in the RAG search index template (config/search/search.j2), which was accidentally removed during the v2.6.0 merge. This caused gpt-rag-ingestion blob storage and SharePoint indexers to fail with Could not find a property named 'parent_id' errors.

Changed

  • Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.7, fixing Log Analytics provisioning failure in Sweden Central caused by forceCmkForQuery default.

[v2.6.3] - 2026-04-08

Changed

  • Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.6.
  • Parametrized Container App CPU and memory per app entry with fallback defaults (0.5 CPU / 1.0Gi).
  • Increased dataingest Container App resources to 1.0 CPU and 3.0Gi memory.
  • Increased text-embedding-3-large deployment capacity from 40 to 100.
  • Bumped gpt-rag-ingestion to v2.3.2.

[v2.6.2] - 2026-04-01

Changed

  • Bumped gpt-rag-orchestrator to v2.6.1.

[v2.6.1] - 2026-04-01

Fixed

  • Fixed Zero Trust provisioning failure caused by jumpbox Custom Script Extension using incorrect release tag. Replaced install_script URL field with ailz_tag in manifest.json, allowing the install script URL and release parameter to be derived from the landing zone tag.

Changed

[v2.6.0] - 2026-03-31

Changed

  • Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.4.
  • Bumped gpt-rag-ui to v2.3.0.
  • Bumped gpt-rag-orchestrator to v2.6.0.
  • Bumped gpt-rag-ingestion to v2.2.4.
  • Added explicit partitionKey to all Cosmos DB container definitions, including /principal_id for conversations container.
  • Added conversation-documents storage container.
  • Added conversationId filterable field to search index.
  • Removed standalone MCP Container App from default deployment (consolidated into orchestrator).

[v2.5.3] - 2026-03-24

Changed

  • Updated default chat model from gpt-5-mini to gpt-5-nano (2025-08-07), increased deployment capacity to 100, and set API version to 2025-12-01-preview.
  • Updated infra submodule to bicep-ptn-aiml-landing-zone tag v1.0.3.
  • Bumped gpt-rag-ui to v2.2.3.
  • Bumped gpt-rag-orchestrator to v2.5.0.

Added

  • Added repository development and release instructions (.github/copilot-instructions.md).

[v2.5.2] - 2026-03-16

Changed

  • Updated pre-deployment behavior to skip cloning a component repository when it already exists locally, improving repeat deployment workflows and avoiding unnecessary clone failures. Closes #428.

Fixed

  • Made virtual environment cleanup in scripts/postProvision.sh non-fatal so post-provisioning continues even if cleanup cannot complete. Closes #426.

[v2.5.1] - 2026-03-06

Changed

  • Updated infra submodule to external bicep-ptn-aiml-landing-zone tag v1.0.1.
  • Bumped gpt-rag-orchestrator to v2.4.2.
  • Bumped gpt-rag-ui to v2.2.2.
  • Improved runtime performance by upgrading the Orchestrator and UI components to v2.4.2 and v2.2.2, respectively.
  • Bumped gpt-rag-ingestion to v2.2.3.

[v2.5.0] - 2026-03-02

Changed

[v2.4.2] - 2026-02-04

Fixed

  • Updated the Docker image to install Microsoft's current public signing key, fixing build failures caused by SHA-1 signature rejection in newer Debian/apt verification policies (orchestrator).
  • Fixed Docker builds on ARM-based machines by explicitly setting the target platform to linux/amd64, preventing Azure Container Apps deployment failures.

Changed

  • Updated the Docker base image.
  • Standardized on the container best practice of using a non-privileged port (8080) instead of a privileged port (80), reducing the risk of runtime/permission friction and improving stability of long-running ingestion workloads.
  • Bumped aiohttp to 3.13.3.

[v2.4.1] - 2026-01-20

Changed

  • Bumped ingestion component version to include reliability improvements for large spreadsheet ingestion.

[v2.4.0] - 2026-01-15

Added

  • Document-level security enforcement for GPT-RAG using Azure AI Search native ACL/RBAC trimming with end-user identity propagation via x-ms-query-source-authorization. Includes permission-aware indexing metadata (userIds/groupIds/rbacScope), safe-by-default behavior for requests without a valid user token, and optional elevated-read debugging support.

Changed

  • Bumped chat model to gpt-5-mini.

[v2.3.0] – 2025-12-15

Added

  • Support for SharePoint Lists in the ingestion component.
  • Refactored Single Agent Strategy to simplify citation handling. [#161]
  • Simplified MCP Strategy. [#159]

Changed

  • Improved robustness of Blob Storage indexing in the ingestion pipeline.
  • Enhanced data ingestion logging for better observability and troubleshooting.

Tested

  • Compatibility with Azure direct models for inference in the orchestration layer.

[v2.2.6] – 2025-12-05

Fixed

  • Fixed Issue #409 by updating the main Bicep template to ensure the SEARCH_CONNECTION_ID app setting points to the correct AI Search connection ID. It was previously pointing to the AI Foundry AI Search dependency.

[v2.2.5] – 2025-12-02

Fixed

  • Fixed Issue #406 by updating networking and private endpoint configuration to prevent the cosmos_vnet_blocked error in Cosmos DB private-only setups.

Changed

  • Automated the creation and registration of the Azure AI Search connection, removing the need for the previous manual workaround.

[v2.2.4] – 2025-11-26

Fixed

  • Fixed a bug in data ingestion component where the Blob storage ingestion process was re-indexing unchanged files when AI Search index had more than 1,000 chunks. Fixed in gpt-rag-ingestion v2.0.6.

Changed

  • Small update in scripts/postProvision.sh to make the Container Apps API Key check more robust by always converting the USE_CAPP_API_KEY variable to lowercase, even when it is unset.

[v2.2.3] – 2025-11-15

Fixed

  • Intermittent AI Foundry post provisioning setup authentication timeout by increasing AzureCliCredential and ManagedIdentityCredential process timeout to 30 seconds in config/aifoundry/setup.py
  • Compatibility with older AZD versions by removing string interpolation syntax from capability host connection arrays in AI Foundry project module (infra/modules/ai-foundry/modules/project/main.bicep lines 229-231)

Changed

  • Suppressed BCP081 warnings for future-dated API versions (2025-01-01, 2025-04-01, 2025-05-01, 2025-06-01) in AI Foundry project module by adding #disable-next-line directives
  • Improved PR and Issue templates
  • Moved documentation to https://aka.ms/gpt-rag-docs
  • Bumped gpt-rag-mcp to v0.2.3

[v2.2.2] – 2025-11-09

Changed

  • Updated infra templates to create the data private endpoint for Azure Container Registry when in network isolation mode.
  • Updated Bastion configuration to retrieve credentials from Key Vault. Users can now simply reset the testvmuser password to access the VM for the first time.

[v2.2.1] – 2025-10-21

Added

  • Added more troubleshooting logs.

Fixed

  • Citations 387

[v2.2.0] – 2025-10-16

Added

  • Bring your own VNet. #370.
  • Agentic Retrieval. #359.

Fixed

  • Citation links opens up new chat windows instead of rendering files #387

[v2.1.2] – 2025-10-02

Changed

  • Fixed a bug in data ingestion component where the SharePoint ingestion process was unnecessarily re-indexing unchanged files.

[v2.1.1] – 2025-09-22

Changed

  • Limit azd environment variables to the script process (no longer persisted to the user profile) to reduce secret exposure. Resolves #378.
  • Streamline AI Search provisioning: now creates only the AI Search index. Previously we also created indexers, skillsets, and data sources that are no longer used and caused confusion about expected runtime behavior. Indexing is performed by the gpt-rag-ingestion jobs — see the ingestion docs for how to run, schedule, or troubleshoot ingest jobs. Resolves #377.

[v2.1.0] – 2025-08-31

Added

Changed

  • Standardized resource group variable as AZURE_RESOURCE_GROUP. #365

[v2.0.5] - 2025-08-26

Fixed

  • Resolved VM deployment errors when using CustomScriptExtension under network isolation.

[v2.0.4] - 2025-08-21

Added

  • Updated orchestrator to version 2.0.3, which includes NL2SQL docs and improved settings checks.

[v2.0.3] - 2025-08-19

Fixed

  • Resolved issue with using Azure Container Apps under a private endpoint in AI Search as a custom web skill.

Added

  • Blob Storage Data Source Ingestion.
  • NL2SQL Metadata Ingestion from Blob Storage.

[v2.0.2] - 2025-08-08

Changed

  • Updated deployment documentation.

[v2.0.1]

Changed

  • Updated deployment documentation.

Fixed

  • Resolved deployment issues introduced in v2.0.0.

[v2.0.0] - 2025-07-15

Changed

  • Major architecture refactor to support the vNext architecture.