The Lundy Navigator project is an AI-powered knowledge assistant designed for real estate Multiple Listing Service (MLS) organizations to streamline the retrieval of information from vast internal documentation.
Core Functionality:
The application leverages Retrieval-Augmented Generation (RAG) to provide accurate, cited answers from trusted internal sources. Its primary functions include:
- Data Ingestion: Supports manual uploads of PDF, HTML, and Text files, as well as an automated, scheduled web crawler to pull content from internal websites.
- Document Processing: Chunks documents into meaningful paragraphs, prepends contextual location data, and converts them into vector embeddings for storage.4
- Intelligent Retrieval: Performs vector searches to provide direct answers to user queries, complete with links to the exact document sections used as sources.
- Multimodal Interaction: Includes support for voice-to-text queries and audible voice output for hands-free use.
- User Management: Features a multi-tiered dashboard (Admin, Editor, Reader) with role-based access control.
- Asynchronous Scaling: A scalable system (v3) uses asynchronous job queues to handle large files and high-volume processing simultaneously.
Technical Stack
The project utilizes a robust modern technology stack:
- Backend: Python Django and Django REST Framework for the core API and application logic.
- Frontend: Next.js and React for the customer-facing dashboard and user interfaces.
- AI & LLM:
- Models: GPT-4, GPT-4o, and GPT-4o-mini.
- Embeddings: OpenAI text-embedding-3 (large and small).
- OCR: PaddleOCR for multilingual text extraction.
- Data Storage:
- Relational: PostgreSQL.
- NoSQL: MongoDB.
- Vector Database: pgvector within Postgres.
- Infrastructure (AWS):
- Compute: Lambda and EC2
- Storage: S3 for document and artifact storage.
- Messaging: SQS, Redis, and Celery for task queuing and event handling.
- Monitoring: CloudWatch for logging.
- Security: Keycloak (OIDC) for authentication and Role-Based Access Control (RBAC), PropelAuth for session management, and encryption at rest




