CyberGuard - Intelligent Analysis, Search and Summarization System

CyberGuard is a complete system that allows uploading documents (PDF, texts, articles, emails) and provides advanced information processing capabilities:

Semantic search based on embedding vectors
Generation of coherent summaries
Context-based questions and answers
Similar content recommendations
Data visualization and analysis

Architecture

The system is built on a modern architecture:

Frontend: HTML/JavaScript with Tailwind CSS
Backend API: FastAPI (Python)
Vector Database: FAISS for fast search
Embeddings: SentenceTransformers (all-MiniLM-L6-v2)
LLM: Integration with OpenAI GPT-3.5 for summarization and Q&A

Features

Upload Center: Upload PDF, TXT files
Text Extractor: Extract text from documents
Chunker: Split text into semantic fragments
Embedding Generator: Transform text into vectors
Vector Index: Store vectors for fast search
Semantic Search: Similarity-based search
Summarizer: Generate coherent summaries
Q&A Chatbot: Context-based answers to questions
Dashboard: View statistics and search history

Installation and Running

Backend

Create a Python virtual environment:

python -m venv venv
source venv/bin/activate  # Linux/Mac
venv\Scripts\activate     # Windows

Install dependencies:

pip install fastapi uvicorn pydantic faiss-cpu openai sentence-transformers PyPDF2

Run the server:

cd astramind
python main.py

The server will run at: http://localhost:8000

Frontend

Open the index.html file from the astramind-client directory in a web browser.

Usage

Upload documents through the drag-and-drop interface
Use the search bar to query the documents
Choose between semantic search, summary generation, or questions and answers
View usage statistics in the dashboard panel

Technologies Used

FastAPI: Python framework for fast APIs
FAISS: Library for efficient vector search
SentenceTransformers: Models for generating embeddings
OpenAI API: For generating summaries and answers
Tailwind CSS: CSS framework for modern design
Chart.js: Library for data visualization

Future Development

Add JWT authentication
Support for more document types (DOCX, HTML, etc.)
Implementation of local models (Llama2) for independence from external APIs
Improvement of the interface for mobile devices
Addition of export and sharing functionalities

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
README.md		README.md
cron_script.py		cron_script.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CyberGuard - Intelligent Analysis, Search and Summarization System

Architecture

Features

Installation and Running

Backend

Frontend

Usage

Technologies Used

Future Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CyberGuard - Intelligent Analysis, Search and Summarization System

Architecture

Features

Installation and Running

Backend

Frontend

Usage

Technologies Used

Future Development

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages