OpenSPG
diff --git a/‎.idea/workspace.xml
Lines changed: 17 additions & 33 deletions b/‎.idea/workspace.xml
Lines changed: 17 additions & 33 deletions
diff --git a/‎website/docs/ch/_category_.json
Lines changed: 7 additions & 0 deletions b/‎website/docs/ch/_category_.json
Lines changed: 7 additions & 0 deletions
diff --git a/‎website/docs/ch/index.md
Lines changed: 81 additions & 0 deletions b/‎website/docs/ch/index.md
Lines changed: 81 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/TechReport/TechnicalReport.md
Lines changed: 8 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/TechReport/TechnicalReport.md
Lines changed: 8 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/TechReport/WhitePaperforOpenSPG.md
Lines changed: 8 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/TechReport/WhitePaperforOpenSPG.md
Lines changed: 8 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/TechReport/_category_.json
Lines changed: 7 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/TechReport/_category_.json
Lines changed: 7 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/TechnologySharing/2024.11.09ShangHaiMeetup.md
Lines changed: 8 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/TechnologySharing/2024.11.09ShangHaiMeetup.md
Lines changed: 8 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/TechnologySharing/KAGIntroductionandapplications.md
Lines changed: 8 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/TechnologySharing/KAGIntroductionandapplications.md
Lines changed: 8 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/TechnologySharing/_category_.json
Lines changed: 7 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/TechnologySharing/_category_.json
Lines changed: 7 additions & 0 deletions
diff --git a/‎website/docs/en/DesignPhilosophy/_category_.json
Lines changed: 7 additions & 0 deletions b/‎website/docs/en/DesignPhilosophy/_category_.json
Lines changed: 7 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Privatedomaindatasets/EnterpriseSupplyChainKnowledgeGraph.md
Lines changed: 10 additions & 0 deletions b/‎website/docs/en/Examples/Privatedomaindatasets/EnterpriseSupplyChainKnowledgeGraph.md
Lines changed: 10 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Privatedomaindatasets/MedicalKnowledgeGraph.md
Lines changed: 11 additions & 0 deletions b/‎website/docs/en/Examples/Privatedomaindatasets/MedicalKnowledgeGraph.md
Lines changed: 11 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Privatedomaindatasets/RiskMiningKnowledgeGraph.md
Lines changed: 10 additions & 0 deletions b/‎website/docs/en/Examples/Privatedomaindatasets/RiskMiningKnowledgeGraph.md
Lines changed: 10 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Privatedomaindatasets/_category_.json
Lines changed: 7 additions & 0 deletions b/‎website/docs/en/Examples/Privatedomaindatasets/_category_.json
Lines changed: 7 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Publicdomaindatasets/Hotpotqa(Multi-hopQ&A).md
Lines changed: 10 additions & 0 deletions b/‎website/docs/en/Examples/Publicdomaindatasets/Hotpotqa(Multi-hopQ&A).md
Lines changed: 10 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Publicdomaindatasets/Musique(Multi-hopQ&A).md
Lines changed: 10 additions & 0 deletions b/‎website/docs/en/Examples/Publicdomaindatasets/Musique(Multi-hopQ&A).md
Lines changed: 10 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Publicdomaindatasets/Twowiki(Multi-hopQ&A).md
Lines changed: 10 additions & 0 deletions b/‎website/docs/en/Examples/Publicdomaindatasets/Twowiki(Multi-hopQ&A).md
Lines changed: 10 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Publicdomaindatasets/_category_.json
Lines changed: 7 additions & 0 deletions b/‎website/docs/en/Examples/Publicdomaindatasets/_category_.json
Lines changed: 7 additions & 0 deletions
diff --git a/‎website/docs/en/Examples/Publicdomaindatasets/benchmark.md
Lines changed: 57 additions & 0 deletions b/‎website/docs/en/Examples/Publicdomaindatasets/benchmark.md
Lines changed: 57 additions & 0 deletions
diff --git a/‎website/docs/tutorial-extras/_category_.json renamed to ‎website/docs/en/Examples/_category_.json
Lines changed: 1 addition & 1 deletion b/‎website/docs/tutorial-extras/_category_.json renamed to ‎website/docs/en/Examples/_category_.json
Lines changed: 1 addition & 1 deletion
@@ -0,0 +1,7 @@
+{
+  "label": "中文文档",
+  "position": 2,
+  "link": {
+    "type": "generated-index"
+  }
+}
@@ -0,0 +1,81 @@
+# 用户手册
+
+
+## [快速开始](快速开始.md)
+
+## 用户向导
+
+- [用户权限管理](用户向导/用户权限管理.md)
+- [知识库配置](用户向导/知识库配置.md)
+
+### 模型服务配置
+
+- [表示(embedding)模型](用户向导/模型服务配置/表示(embedding)模型.md)
+- [生成(chat)模型](用户向导/模型服务配置/生成(chat)模型.md)
+
+### 知识建模
+
+- [声明式schema](用户向导/知识建模/声明式schema.md)
+- [建模最佳实践](用户向导/知识建模/建模最佳实践.md)
+- [概念关系语义分类](用户向导/知识建模/概念关系语义分类.md)
+- [可视化查看](用户向导/知识建模/可视化查看.md)
+
+### 知识推理语法
+
+- [专家规则（DSL）语法](用户向导/知识推理语法/专家规则（DSL）语法.md)
+- [物理引擎适配RDG](用户向导/知识推理语法/物理引擎适配RDG.md)
+
+### 自定义扩展
+
+- [自定义schema](用户向导/自定义扩展/自定义schema.md)
+- [自定义prompt](用户向导/自定义扩展/自定义prompt.md)
+- [自定义代码](用户向导/自定义扩展/自定义代码.md)
+- [领域知识挂载](用户向导/自定义扩展/领域知识挂载.md)
+- [知识探查](用户向导/知识探查.md)
+- [命令行工具](用户向导/命令行工具.md)
+- [源码编译&部署](用户向导/源码编译&部署.md)
+- [v0.5如何升级到v0.6](用户向导/v0.5如何升级到v0.6.md)
+
+## 内置案例
+
+
+### 垂域数据集
+
+- [企业供应链](内置案例/垂域数据集/企业供应链.md)
+- [黑产挖掘](内置案例/垂域数据集/黑产挖掘.md)
+- [医疗图谱](内置案例/垂域数据集/医疗图谱.md)
+
+### 公开数据集
+
+- [benchmark](内置案例/公开数据集/benchmark.md)
+- [hotpotqa](内置案例/公开数据集/hotpotqa.md)
+- [musique](内置案例/公开数据集/musique.md)
+- [twowiki](内置案例/公开数据集/twowiki.md)
+
+## 系统集成
+
+- [HTTPAPI Reference](系统集成/HTTPAPIReference.md)
+
+## 设计理念&代码架构
+
+
+### 设计理念
+
+- [KAG技术报告](设计理念&代码架构/设计理念/KAG技术报告.md)
+- [SPG白皮书](设计理念&代码架构/设计理念/SPG白皮书.md)
+
+### 技术分享
+
+- [KAGintroduction and applications](设计理念&代码架构/技术分享/KAGintroductionandapplications.md)
+- [2024.11.09ShangHai Meetup](设计理念&代码架构/技术分享/2024.11.09ShangHaiMeetup.md)
+
+## 版本升级说明
+
+- [开发者和产品侧差异](版本升级说明/开发者和产品侧差异.md)
+- [ReleaseNotes](版本升级说明/ReleaseNotes.md)
+
+## 参与贡献
+
+- [KAG近期迭代重点](参与贡献/KAG近期迭代重点.md)
+
+## [常见问题](常见问题.md)
@@ -0,0 +1,8 @@
+---
+sidebar_position: 2
+---
+
+# Technical Report
+
+[technical report of KAG · OpenSPG · Discussion #51](https://github.com/orgs/OpenSPG/discussions/51)
+
@@ -0,0 +1,8 @@
+---
+sidebar_position: 1
+---
+
+# White Paper for OpenSPG
+
+[white paper for openspg · OpenSPG · Discussion #50](https://github.com/orgs/OpenSPG/discussions/50)
+
@@ -0,0 +1,7 @@
+{
+  "label": "Tech Report",
+  "position": 2,
+  "link": {
+    "type": "generated-index"
+  }
+}
@@ -0,0 +1,8 @@
+---
+sidebar_position: 1
+---
+
+# 2024.11.09 ShangHai Meetup
+
+[1109 Meetup PPT · OpenSPG · Discussion #45](https://github.com/orgs/OpenSPG/discussions/45)
+
@@ -0,0 +1,8 @@
+---
+sidebar_position: 2
+---
+
+# KAG Introduction and applications
+
+[KAG introduction and applications · OpenSPG · Discussion #52](https://github.com/orgs/OpenSPG/discussions/52)
+
@@ -0,0 +1,7 @@
+{
+  "label": "Technology Sharing",
+  "position": 1,
+  "link": {
+    "type": "generated-index"
+  }
+}
@@ -0,0 +1,7 @@
+{
+  "label": "Design Philosophy",
+  "position": 6,
+  "link": {
+    "type": "generated-index"
+  }
+}
@@ -0,0 +1,10 @@
+---
+sidebar_position: 1
+---
+
+# Enterprise Supply Chain Knowledge Graph
+
+The documentation for the examples has been migrated to GitHub; please refer to 
+
+[KAG/kag/examples/supplychain/README.md at master · OpenSPG/KAG](https://github.com/OpenSPG/KAG/blob/master/kag/examples/supplychain/README.md)
+
@@ -0,0 +1,11 @@
+---
+sidebar_position: 3
+---
+
+# Medical Knowledge Graph
+
+The documentation for the examples has been migrated to GitHub; please refer to 
+
+[KAG/kag/examples/medicine/README.md at master · OpenSPG/KAG](https://github.com/OpenSPG/KAG/blob/master/kag/examples/medicine/README.md)
+
+
@@ -0,0 +1,10 @@
+---
+sidebar_position: 2
+---
+
+# Risk Mining Knowledge Graph
+
+The documentation for the examples has been migrated to GitHub; please refer to 
+
+[KAG/kag/examples/riskmining/README.md at master · OpenSPG/KAG](https://github.com/OpenSPG/KAG/blob/master/kag/examples/riskmining/README.md)
+
@@ -0,0 +1,7 @@
+{
+  "label": "Private domain datasets",
+  "position": 1,
+  "link": {
+    "type": "generated-index"
+  }
+}
@@ -0,0 +1,10 @@
+---
+sidebar_position: 2
+---
+
+# Hotpotqa(Multi-hop Q&A)
+
+The documentation for the examples has been migrated to GitHub; please refer to 
+
+[KAG/kag/examples/hotpotqa/README.md at master · OpenSPG/KAG](https://github.com/OpenSPG/KAG/blob/master/kag/examples/hotpotqa/README.md)
+
@@ -0,0 +1,10 @@
+---
+sidebar_position: 4
+---
+
+# Musique(Multi-hop Q&A)
+
+The documentation for the examples has been migrated to GitHub; please refer to 
+
+[KAG/kag/examples/musique/README.md at master · OpenSPG/KAG](https://github.com/OpenSPG/KAG/blob/master/kag/examples/musique/README.md)
+
@@ -0,0 +1,10 @@
+---
+sidebar_position: 3
+---
+
+# Twowiki(Multi-hop Q&A)
+
+The documentation for the examples has been migrated to GitHub; please refer to 
+
+[KAG/kag/examples/2wiki/README.md at master · OpenSPG/KAG](https://github.com/OpenSPG/KAG/blob/master/kag/examples/2wiki/README.md)
+
@@ -0,0 +1,7 @@
+{
+  "label": "Public domain datasets",
+  "position": 2,
+  "link": {
+    "type": "generated-index"
+  }
+}
@@ -0,0 +1,57 @@
+---
+sidebar_position: 1
+---
+
+# benchmark
+
+# Performance on multi-hop factual QA tasks
+The EM and F1 metrics of the version V0.6 of KAG, using the same experiment configuration as the KAG technical report ([https://arxiv.org/pdf/2409.13731](https://arxiv.org/pdf/2409.13731)), are as follows:
+
+| **** | **HotpotQA** | | **2Wiki** | | **MuSiQue** | **Average** |
+| --- | --- | --- | --- | --- | --- | --- |
+| | EM | F1 | EM | F1 | EM | F1 | EM | F1 |
+| GraphRAG | 0 |  | 0 |  | 0 |  | 0 |  |
+| lightRAG | 0 |  | 0 |  | 0 |  | 0 |  |
+| KAG TechReport<br/>（DeepSeek-V2 API） | 62.5 | 76.2 | 67.8 | 76.2 | 36.7 | 48.7 | 55.6 | 67.0 |
+| V0.6<br/>（DeepSeek-V2.5 API） | 60.9 | 75.4 | 69.6 | 78.6 | 36.1 | 48.2 | 55.5 | 67.4 |
+
+
+Note: The metrics of the KAG technical report are taken from Table 10 of the original text.
+
+# Performance on query-focused summarization (QFS) tasks
++ **Dataset**
+
+To evaluate the performance of KAG on query-focused summarization tasks, we compared the outputs of KAG and LightRAG on the [UltraDomain](https://huggingface.co/datasets/TommyChien/UltraDomain/tree/main) **cs.jsonl** dataset. Please refer to [KAG Example: CSQA](https://github.com/OpenSPG/KAG/tree/master/kag/examples/csqa).
+
+The **cs.jsonl** file contains 10 documents from the field of Computer Science, along with 100 questions and their corresponding answers based on these documents. Unlike the comparison method in the LightRAG paper, we used the questions provided in the **cs.jsonl** file rather than generating questions using a large language model for evaluation. Additionally, when calculating the factual correctness metric, we used the answers to the questions in the **cs.jsonl** file as the ground-truth.
+
++ **Quantitative evaluation results**
+
+| **** | **KAG** | **LightRAG** |
+| --- | --- | --- |
+| Comprehensiveness（0~10） | 7.57 | 8.87 |
+| Diversity（0~10） | 6.87 | 8.28 |
+| Empowerment（0~10） | 7.54 | 8.53 |
+| Factual Correctness（0~1） | 0.365 | 0.352 |
+| Construction time consumption | 4800 seconds | 3400 seconds |
+| Construction token consumption   | 7,006 K | 4,428 K |
+| Basic experiment configuration | generative model: deepseek-chat<br/>representational model: bge-m3<br/>concurrency:<br/>50 (num_threads_per_chain=50, num_chains=16)<br/>chunk size:<br/>(split_length=4950, window_length=100) | generative model: deepseek-chat<br/>representational model: bge-m3<br/>concurrency:<br/>50 (llm_model_max_async=50, embedding_func_max_async=50)<br/>chunk size:<br/>(chunk_token_size=1200, chunk_overlap_token_size=100) |
+
+
++ **Metric Interpretation**
+
+In this release, from the perspective of metrics, KAG has shown improvement in summarization tasks compared to the previous version, but there is still a gap compared to LightRAG. At the same time, in order to support both summarization generation and factual question answering, more work was done during knowledge extraction, which increased token consumption. We will continue to optimize this in future versions.
+
+EM and F1 metrics are not shown in the table because when we used the HotpotQA dataset for construction and evaluation, we found that the outputs of LightRAG, GraphRAG, and KAG (using the optimized prompt) all yielded an EM of 0 and an F1 close to 0 when compared to the HotpotQA evaluation dataset. We believe that EM and F1 are not suitable for evaluating the outputs of summarization tasks.
+
+
+
+We also acknowledge that the four evaluation metrics in the table are not perfect.
+
++ **Comprehensiveness**, **Diversity**, and **Empowerment** metrics are sensitive to the order of the answers during evaluation. Please refer to this [issue](https://github.com/HKUDS/LightRAG/issues/438) and the LightRAG paper.
++ **Factual Correctness** depends on large language models, and the output is unstable. We also experimented with the Factual Correctness calculation method from RAGAS, but it was even more unstable than the method shown in the CSQA example.
+
+If you have more reasonable evaluation methods, please feel free to provide feedback.
+
+
+
@@ -1,5 +1,5 @@
 {
-  "label": "Tutorial - Extras",
+  "label": "Examples",
   "position": 3,
   "link": {
     "type": "generated-index"
Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`{`
`2`		`- "label": "Tutorial - Extras",`
	`2`	`+ "label": "Examples",`
`3`	`3`	`"position": 3,`
`4`	`4`	`"link": {`
`5`	`5`	`"type": "generated-index"`