engineering·Independent✓ Verified

Build a PDF Q&A System with LlamaIndex, OpenAI Embeddings & Pinecone Vector DB

Parse, Normalize, Extract, and Store PDF Content for RAG in Pinecone

About

Parse, Normalize, Extract, and Store PDF Content for RAG in Pinecone This workflow automates a full RAG pipeline for structured documents (like insurance policies). What it does Watches a Google Drive folder for new PDFs Uploads to LlamaIndex Cloud for parsing → returns clean Markdown Normalizes text (removes headers, footers, page numbers, formatting artifacts) Splits text into chunks (~1200 chars with 150 overlap) Generates embeddings with OpenAI Stores vectors in Pinecone with m

Command-Line Agentic Refactoring of Java Code

Free

engineering

Opencode Plan Manager

A simple collection of tools for better plan management by AI agents on OpenCode.

Free

engineering

Tabnine

Privacy-first AI code completion for enterprise teams

$12/mo

engineering

Kitwork

Automate kit workflows effortlessly with a lightweight, high-performance, fast, and flexible engine for cloud or self-hosted environments.

Free

Build a PDF Q&A System with LlamaIndex, OpenAI Embeddings & Pinecone Vector DB

About

Tags

More in engineering