Memory-Powered AI Chatbot

This project is a FastAPI chatbot built with LangGraph, LangChain, and SQLite-backed persistent memory. It supports provider-based LLM configuration and is set up to use Groq by default. Each request reloads the stored conversation for a session_id, builds a bounded prompt context (sliding window or summary + window), invokes the model, persists the new turn, and returns the updated history.

Architecture

Requirements

Python 3.12
A Groq API key for the default setup

Quick Start

python3 -m venv .venv
. .venv/bin/activate
python -m pip install --upgrade pip
pip install -r requirements.txt
cp .env.example .env

Update .env and set a valid GROQ_API_KEY.

The application validates required configuration during startup. If the API key for the selected provider is missing, startup exits with a clear error message instead of waiting until the first chat request.

Run the API

. .venv/bin/activate
uvicorn main:app --reload

You can also run:

. .venv/bin/activate
python main.py

Health check:

curl http://127.0.0.1:8000/health

Expected response:

{"status":"ok"}

Run Tests

. .venv/bin/activate
pytest

Memory Controls

MEMORY_STRATEGY controls context assembly (sliding_window or summary_window).
MEMORY_WINDOW_SIZE controls how many recent message pairs are kept verbatim.
MAX_CONTEXT_TOKENS sets a rough context cap using chars // 4 token estimation.
The system prompt is always placed at index 0 in the model input.
For summary_window, rolling summary state is persisted in SQLite (chat_summaries) so it survives restarts.

Example Chat Request

curl -X POST http://127.0.0.1:8000/chat \
  -H "Content-Type: application/json" \
  -d '{
    "session_id": "user-123",
    "message": "Hi, my name is Hasib."
  }'

Second turn with the same session:

curl -X POST http://127.0.0.1:8000/chat \
  -H "Content-Type: application/json" \
  -d '{
    "session_id": "user-123",
    "message": "What is my name?"
  }'

Because the same session_id is reused, the application loads the earlier conversation from SQLite before calling the model again.

Validation and Error Handling

session_id must be a non-empty string and is limited to 255 characters.
message must be non-empty and is limited to 8000 characters.
Invalid requests return a consistent error shape:

{
  "error": {
    "code": "validation_error",
    "message": "session_id: Value error, must not be blank"
  }
}

Project Structure

app/
  __init__.py
  config.py
  context_window.py
  graph.py
  llm.py
  main.py
  memory.py
  schemas.py
  state.py
tests/
  test_api.py
  test_graph.py
  test_memory.py
main.py
README.md
.env.example
requirements.txt

Request Flow

The client sends session_id and message to POST /chat.
FastAPI receives the request in app/main.py.
The LangGraph StateGraph invokes process_message.
process_message loads persistent history and summary state from SQLite.
A bounded model context is assembled using the configured memory strategy and token cap.
The system prompt is always placed at position 0, then recent/summary context and the new user message are appended.
The configured LLM provider receives this bounded context.
The assistant reply is appended and written back to SQLite.
The API returns the assistant reply and the updated history.

Persistence Details

Chat history is stored in SQLite at data/chat_memory.db by default.
Each stored message contains session_id, role, content, and a timestamp.
Messages are loaded in insertion order for each session.
System, human, and AI messages are persisted so later turns can reuse context.
Summary mode persists rolling summaries in chat_summaries with summarized_upto_message_id.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
app		app
image		image
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
funda.txt		funda.txt
main.py		main.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory-Powered AI Chatbot

Architecture

Requirements

Quick Start

Run the API

Run Tests

Memory Controls

Example Chat Request

Validation and Error Handling

Project Structure

Request Flow

Persistence Details

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Memory-Powered AI Chatbot

Architecture

Requirements

Quick Start

Run the API

Run Tests

Memory Controls

Example Chat Request

Validation and Error Handling

Project Structure

Request Flow

Persistence Details

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages