Citegeist

Answers your AI related questions with information from research papers using a RAG LLM system

Requirements

Nvidia GPU
Docker
Docker-compose

Installation

Install Docker and Docker-compose

https://www.docker.com/get-started/
https://docs.docker.com/compose/install/

Clone project

run command
git clone https://github.com/segallagher/Citegeist.git

Create vectorstore from papers

Create .env file with the following values \

DATASET_PATH="data/arxiv_dataset/dataset.csv"
VECTORSTORE_DIR="vectorstore"
PAPER_DIR="published/papers"

Assemble papers into one csv file by running
python .\dataset_creation\assemble_dataset.py
Create vectorstore from dataset
python .\create_vectorstore_db.py

Use

Start Project

Enter directory for Citegeist in terminal
cd Citegeist
run docker-compse
docker-compose up

Access webui

In your browser go to the webui at
http://localhost:80

Ask Citegeist an AI research question

In the chatbox, enter your question and get your answers.

Benchmarking

Env vars

Add the following to your .env file \

EMBED_MODEL_TYPE="ollama"
EMBED_MODEL="mxbai-embed-large:latest"
LLM_MODEL_TYPE="ollama"
LLM_MODEL="llama3.2:3b"
OLLAMA_HOST="http://localhost:11434"

Evaluate

Evaluation happens in evaluate_rag.py. Depending on which type of evaluation you want to do you must set the OPERATION parameter to the type of evaluation you want to do.
The questions are stored in data_analysis/questions.json
Each question will be answered by the RAG LLM system and then judged by llama3.1:8b
Run evaluate_rag.py
This will likely take a while running ollama on your computer

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
app		app
data_analysis		data_analysis
dataset_creation		dataset_creation
published		published
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
create_vectorstore_db.py		create_vectorstore_db.py
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
evaluate_rag.py		evaluate_rag.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
start.bat		start.bat
start.sh		start.sh
test_chat.py		test_chat.py
tmp.txt		tmp.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citegeist

Requirements

Installation

Install Docker and Docker-compose

Clone project

Create vectorstore from papers

Use

Start Project

Access webui

Ask Citegeist an AI research question

Benchmarking

Env vars

Evaluate

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Citegeist

Requirements

Installation

Install Docker and Docker-compose

Clone project

Create vectorstore from papers

Use

Start Project

Access webui

Ask Citegeist an AI research question

Benchmarking

Env vars

Evaluate

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages