# OSINT RAG App Quickstart ## Stack - **Frontend**: Gradio 4.0+ (ChatInterface with auto API endpoints) - **Database**: Supabase PGVector (768-dim embeddings, HNSW index) - **LLM**: HuggingFace Inference API (Llama-3.1-8B-Instruct) - **Embeddings**: HuggingFace Inference API (all-mpnet-base-v2, 768-dim) - **Client**: Supabase Python client + InferenceClient (huggingface_hub) ## Key Parameters - **Temperature**: 0.2 (low hallucination) - **Max Tokens**: 600 (short responses) - **Retrieval K**: 5 tools - **Match Threshold**: 0.5 (cosine similarity) - **Connection**: Transaction Pooler (port 6543)