Skip to content

Commit c3a3206

Browse files
authored
Manuals docs (#156)
1 parent 3a4b66f commit c3a3206

File tree

1 file changed

+16
-2
lines changed

1 file changed

+16
-2
lines changed

examples/manuals_llm_extraction/README.md

Lines changed: 16 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,16 @@
1+
# Structured Data Extraction from PDF with Ollama and CocoIndex
2+
3+
![Structured data extraction with Ollama and CocoIndex](https://cocoindex.io/blogs/assets/images/cocoindex-ollama-structured-extraction-from-pdf-6ee15b1e0fe304063dc78f04153fb385.png)
4+
5+
16
In this example, we
27

38
* Converts PDFs (generated from a few Python docs) into Markdown.
49
* Extract structured information from the Markdown using LLM.
510
* Use a custom function to further extract information from the structured output.
611

12+
Please give [Cocoindex on Github](https://github.com/cocoindex-io/cocoindex) a star to support us if you like our work. Thank you so much with a warm coconut hug 🥥🤗. [![GitHub](https://img.shields.io/github/stars/cocoindex-io/cocoindex?color=5B5BD6)](https://github.com/cocoindex-io/cocoindex)
13+
714
## Prerequisite
815

916
Before running the example, you need to:
@@ -47,14 +54,21 @@ And run the SQL query:
4754
```sql
4855
SELECT filename, module_info->'title' AS title, module_summary FROM modules_info;
4956
```
57+
You should see results like:
58+
59+
![Module Info Index](https://cocoindex.io/blogs/assets/images/module_info_index-ffaec6042ec3a18eaf94bed5b227a085.png)
60+
5061

5162
## CocoInsight
52-
CocoInsight is in Early Access now (Free) 😊 You found us! A quick 3 minute video tutorial about CocoInsight: [Watch on YouTube](https://youtu.be/ZnmyoHslBSc?si=pPLXWALztkA710r9).
63+
CocoInsight is a tool to help you understand your data pipeline and data index. CocoInsight is in Early Access now (Free) 😊 You found us! A quick 3 minute video tutorial about CocoInsight: [Watch on YouTube](https://youtu.be/ZnmyoHslBSc?si=pPLXWALztkA710r9).
5364

5465
Run CocoInsight to understand your RAG data pipeline:
5566

5667
```
5768
python main.py cocoindex server -c https://cocoindex.io
5869
```
5970

60-
Then open the CocoInsight UI at [https://cocoindex.io/cocoinsight](https://cocoindex.io/cocoinsight).
71+
Then open the CocoInsight UI at [https://cocoindex.io/cocoinsight](https://cocoindex.io/cocoinsight). It connects to your local CocoIndex server with zero data retention.
72+
73+
You can view the pipeline flow and the data preview in the CocoInsight UI:
74+
![CocoInsight UI](https://cocoindex.io/blogs/assets/images/cocoinsight-edd71690dcc35b6c5cf1cb31b51b6f6f.png)

0 commit comments

Comments
 (0)