You are browsing as a guest. Sign up (or log in) to start making projects!

RAG App

  • 3 Devlogs
  • 2 Total hours

An app that can read a document and answer users' questions using an RAG pipeline. By using a Google GenAI API, the application can understand the text and provide an answer to the user. Moreover, the user can input content through the camera, files, and text.

Open comments for this post

40m 8s logged

Added an OCR feature to allow handwritten text to be used. I also updated the user-query feature by allowing the user to select their own settings for the RAG pipeline. Moreover, I was able to allow the user to upload files and improved the UI to make it look better.

Added an OCR feature to allow handwritten text to be used. I also updated the user-query feature by allowing the user to select their own settings for the RAG pipeline. Moreover, I was able to allow the user to upload files and improved the UI to make it look better.

Replying to @revankotapati

0
6
Ship #1

For this project, I engineered a containerized, cloud-native Retrieval-Augmented Generation (RAG) assistant deployed on Hugging Face Spaces that enables users to securely upload documentation and query it in real time through a Streamlit chat interface. Writing native code directly with the modern google-genai and Pinecone Python SDKs allowed me to bypass bulky orchestrator frameworks, resulting in a lightweight, zero-local-storage pipeline that uses text-embedding-004 and gemini-1.5-flash. Navigating this build came with distinct engineering challenges, particularly resolving rigid SDK endpoint routing mismatches and optimizing a vector dimensionality gap by implementing types.EmbedContentConfig to dynamically truncate embedding outputs down to 768 dimensions to match the Pinecone serverless index. I am incredibly proud of how fast, clean, and cost-effective this standalone architecture turned out, as well as the strict system prompt guardrails that prevent hallucinations by defaulting to a clean fallback response whenever a query falls outside the uploaded context. To thoroughly test the live application, users simply need to verify their API keys, paste a technical text block into the ingestion sidebar to index the cloud vectors, and then run a mix of specific domain questions and out-of-domain prompts to observe the system's precise retrieval performance and defensive constraints.

  • 3 devlogs
  • 2h
Try project → See source code →
Open comments for this post

44m 1s logged

Finished backend and frontend connection.

Finished backend and frontend connection.

Replying to @revankotapati

0
10
Open comments for this post

1h 0m 28s logged

I built the frontend and backend of this RAG pipeline using Python, GoogleGenAI, and Pinecone

I built the frontend and backend of this RAG pipeline using Python, GoogleGenAI, and Pinecone

Replying to @revankotapati

0
5

Followers

Loading…