🍽️ TripAdvisor Foodbot

foodbot_demo.mp4

📝 Overview

Note

📦 Data for this project is sourced from repositories in the include directory. It is scraped, processed, and cleaned using multiple modules. The dm_tripadvisor and fs_tripadvisor datasets are publicly available in BigQuery for authenticated users and power the restaurant recommendation system.

A next-generation restaurant recommendation system implementing RAG (Retrieval-Augmented Generation) architecture with vector search and multi-criteria decision analysis (MCDA - ELECTRE III). Built on LlamaIndex for agent orchestration, Qdrant for vector storage, and BigQuery for data operations.

🚀 Core Components

Vector Search: Qdrant with FastEmbed for dense retrieval
MCDA Engine: ELECTRE III implementation for ranking restaurants
LLM Integration: OpenAI API with streaming response to generate final natural responses
Data Layer: BigQuery for structured data + Qdrant collections
Agent Framework: LlamaIndex with custom tools and callbacks

⚙️ Technical Implementation

Vector Search Pipeline

FastEmbed for dense embeddings generation
Qdrant collections for restaurant vectors
Hybrid search combining semantic and metadata filtering

MCDA Implementation

ELECTRE III algorithm for restaurant ranking
Custom concordance/discordance thresholds
Multi-criteria evaluation:
- Food quality (delicious, fresh, etc.)
- Price sensitivity (affordable, expensive, etc.)
- Ambience (quiet, cozy, etc.)
- Service (friendly, fast, polite, etc.)
- Distance to user location (using distance mapping)
- Query matching (using cosine similarity)

Pre-processing before ranking: Convert review sentiment (positive, negative, etc.) to numerical scores, then apply ELECTRE III to rank restaurants based on user preferences.

Important

⚡ Because ELECTRE III is a decision analysis algorithm calculated based on lots of matrix operations, it can take significant time to rank restaurants with numpy. To improve performance and user experience, we use numba to speed up the ranking process by compiling the numpy-based functions with njit (@njit - alias for @jit(nopython=True)). This enables our Python functions to run at near-C speed.

📁 Repository Structure

src/
├── bigquery/    # BigQuery operations and data handlers
├── chat/        # LlamaIndex agent implementation
├── helper/      # Utility functions
├── qdrant/      # Vector DB operations
├── ranker/      # ELECTRE III implementation
└── s3/          # S3 client for asset storage (user storage)

🤖 Agent Implementation - `chat` Directory

LlamaIndex RAG implementation
Custom tools for:
- candidate_generation_and_ranking: generate candidate restaurants and rank them using MCDA
- enrich_restaurant_recommendations: enrich recommendations with more information and generate the final natural response
Streaming response handlers for tool callbacks
Context management with chat history

sequenceDiagram
    actor User
    participant LlamaAgent as LlamaIndex Agent
    participant Qdrant as Qdrant Vector DB
    participant BigQuery as BigQuery Data Layer
    participant ELECTRE as ELECTRE III Ranker

    Note right of User: Submit restaurant query<br/>with weights profile
    User->>LlamaAgent: ("Find Korean BBQ with cozy vibe")
    Note right of LlamaAgent: Vector Search & Candidate Generation
    LlamaAgent->>+Qdrant: Hybrid search query<br/>(semantic + metadata filters)
    Qdrant->>+Qdrant: Search restaurant vectors<br/>(cosine similarity)
    Qdrant-->>-LlamaAgent: Return 100*K candidate restaurants<br/>(with similarity scores)

    LlamaAgent->>+ELECTRE: Scoring and Ranking Phase
    Note right of LlamaAgent: MCDA Ranking with ELECTRE III
    ELECTRE->>+ELECTRE: Calculate criteria matrix<br/>(Food Quality, Price, Ambience,<br/>Service, Distance, Query Match)
    ELECTRE->>+ELECTRE: Apply ELECTRE III algorithm<br/>with weights set
    ELECTRE-->>-LlamaAgent: Return top-K restaurants
    Note right of LlamaAgent: Data Enrichment Phase
    LlamaAgent->>+BigQuery: Query structured metadata on BigQuery<br/>(fs_tripadvisor)
    BigQuery->>+BigQuery: Fetch details (address, image, website,...)
    BigQuery-->>-LlamaAgent: Return enriched data
    Note right of LlamaAgent: Generate Final Response
    LlamaAgent->>User: Deliver personalized<br/>restaurant recommendations

🛠️ Development Setup

📋 Requirements

🐍 python ~= 3.11
📦 uv package manager (for python deps)
🌐 npx (for prisma CLI)
🔑 OpenAI API key: OpenAI Console (Mandatory)
☁️ GCP account with BigQuery: GCP Console (Optional, you can use local storage)
🟣 Qdrant Cloud instance: Qdrant Console (Optional, you can use my docker containers)
🗄️ AWS S3 (or alternative storage service using S3 API) (Optional, you can use my docker containers)

⚡ Quick Start

Note

For local development, you can use the provided Docker containers for Qdrant and S3. If you want to use remote services, you can set up your own Qdrant and S3 instances or use the provided credentials in the .env file.

For remote services, if you want to use BigQuery, you must put the sa.json (service account JSON file) in the root directory and set the FEATURE_STORAGE_MODE to remote in the src/helper/vars.py file. If you want to use local storage, set it to local.
For Qdrant and S3, you can set the QDRANT_* and *_AWS_* environment variables in the .env file to your remote service credentials.

Install dependencies with uv
```
uv sync
```
Set up environment variables

Copy the .env.example file to .env and fill in the required values:
```
cp .env.example .env
```
Change the OPENAI_API_KEY to your OpenAI API key.
Start containers (optional - ignore if using remote services)

If you want to use local Qdrant and S3, you can start the containers using docker-compose:
```
docker compose up -d
```
This will start Qdrant and S3-compatible storage services locally (MinIO). Make sure you have Docker installed and running. If you want to use remote services, you can skip this step.
Initialize Qdrant collections

If you are using Qdrant, you need to initialize the collections. You can do this by running the following command:
```
make load_qdrant
```
This will create the necessary collections in Qdrant and load the initial data.
Initialize Chatbot Schema (Prisma)

Because we are using Prisma for the storage of the chatbot schema, you need to run the following command to generate the Prisma client and apply the schema migrations:
```
make db
```
This will generate the Prisma client and apply the schema migrations.

Change the DATABASE_URL in the .env file to your database connection string if you are using a remote database.
Start development server
```
make run
```

Check your http://localhost:8000 for the LlamaIndex agent UI.

📦 Dependencies

This project uses uv as the package manager. Dependencies are managed in the uv configuration file (pyproject.toml).

Core: llama-index, fastembed, qdrant-client
Data: google-cloud-bigquery, pandas
UI: chainlit

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.chainlit		.chainlit
.github/workflows		.github/workflows
include		include
prisma		prisma
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
.prettierignore		.prettierignore
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
chainlit.md		chainlit.md
compose.yaml		compose.yaml
package.json		package.json
pyproject.toml		pyproject.toml
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍽️ TripAdvisor Foodbot

📝 Overview

🚀 Core Components

⚙️ Technical Implementation

Vector Search Pipeline

MCDA Implementation

📁 Repository Structure

🤖 Agent Implementation - `chat` Directory

🛠️ Development Setup

📋 Requirements

⚡ Quick Start

📦 Dependencies

📄 License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🍽️ TripAdvisor Foodbot

📝 Overview

🚀 Core Components

⚙️ Technical Implementation

Vector Search Pipeline

MCDA Implementation

📁 Repository Structure

🤖 Agent Implementation - chat Directory

🛠️ Development Setup

📋 Requirements

⚡ Quick Start

📦 Dependencies

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages

🤖 Agent Implementation - `chat` Directory