Pipeline Dashboard
Medicines 0
PDFs Downloaded 0
Extracted ✅ 0
Extracted ⚠️ 0
Chunks Embedded 0
📥 Step 1: Download PDFs
○ IdleFetches PDFs from EU Medicine Register and extracts raw text.
🔬 Step 2: Extract Sections
○ IdleParses patient leaflet from PDF, splits into 6 standard sections. Validates all sections found.
🧠 Step 3: Embed
○ IdleChunks extracted sections and generates OpenAI text-embedding-3-large vectors. Only runs on validated extractions.