Pipeline Dashboard

Medicines 0
PDFs Downloaded 0
Extracted ✅ 0
Extracted ⚠️ 0
Chunks Embedded 0

📥 Step 1: Download PDFs

○ Idle

Fetches PDFs from EU Medicine Register and extracts raw text.

🔬 Step 2: Extract Sections

○ Idle

Parses patient leaflet from PDF, splits into 6 standard sections. Validates all sections found.

🧠 Step 3: Embed

○ Idle

Chunks extracted sections and generates OpenAI text-embedding-3-large vectors. Only runs on validated extractions.