example(dspy): extract patient intake forms by DSPy #1365

georgeh0 · 2025-12-04T19:13:32Z

No description provided.

badmonster0 · 2025-12-04T19:15:14Z

badmonster0 · 2025-12-04T19:15:56Z

thanks a lot @georgeh0 for the example. excited for both cocoindex and dspy community!

prrao87 · 2025-12-04T19:20:33Z

examples/patient_intake_extraction_dspy/main.py

+
+    form_images = []
+    for page in pdf_doc:
+        # Render page to pixmap (image) at 2x resolution for better quality


This part is super cool! I've found that parsing images of higher quality results in fewer typos in the resulting output, at the cost of more tokens. But super useful approach to allow it to upscale the image!

prrao87 · 2025-12-04T19:21:19Z

examples/patient_intake_extraction_dspy/pyproject.toml

+    "cocoindex>=0.3.9",
+    "dspy-ai>=3.0.4",
+    "pydantic>=2.0.0",
+    "pymupdf>=1.24.0",


@georgeh0 what is the need for pymupdf here if the LLM is directly extracting from the image bytes?

Is this what fitz is? Oops, now I get it.

prrao87 · 2025-12-04T19:23:30Z

This example is super cool! Love the usage of PyMuPDF to convert the PDF to image bytes and upsize it for the LLM. Really good methodology 👌🏽

example(dspy): extract patient intake forms by DSPy

9142828

badmonster0 merged commit 902e5c9 into main Dec 4, 2025
6 checks passed

georgeh0 deleted the g/dspy branch December 4, 2025 19:16

prrao87 reviewed Dec 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

example(dspy): extract patient intake forms by DSPy #1365

example(dspy): extract patient intake forms by DSPy #1365

Uh oh!

georgeh0 commented Dec 4, 2025

Uh oh!

badmonster0 commented Dec 4, 2025

Uh oh!

badmonster0 commented Dec 4, 2025

Uh oh!

Uh oh!

prrao87 Dec 4, 2025

Uh oh!

prrao87 Dec 4, 2025

Uh oh!

prrao87 Dec 4, 2025

Uh oh!

prrao87 commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

example(dspy): extract patient intake forms by DSPy #1365

example(dspy): extract patient intake forms by DSPy #1365

Uh oh!

Conversation

georgeh0 commented Dec 4, 2025

Uh oh!

badmonster0 commented Dec 4, 2025

Uh oh!

badmonster0 commented Dec 4, 2025

Uh oh!

Uh oh!

prrao87 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

prrao87 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

prrao87 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

prrao87 commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants