Add table to list port, endpoint, framework, model, serving, and hardware for each microservice in ChatQnA (#697)

ctao456 · srinarayan-srikanthan · pre-commit-ci[bot] · web-flow · commit 1a934afb3a0b · 2024-09-11T15:45:08.000+08:00
Signed-off-by: srinarayan-srikanthan &lt;srinarayan.srikanthan@intel.com&gt;
Signed-off-by: Chun Tao &lt;chun.tao@intel.com&gt;
Signed-off-by: letonghan &lt;letong.han@intel.com&gt;
Signed-off-by: Ye, Xinyu &lt;xinyu.ye@intel.com&gt;
Signed-off-by: chensuyue &lt;suyue.chen@intel.com&gt;
Signed-off-by: Yue, Wenjiao &lt;wenjiao.yue@intel.com&gt;
Signed-off-by: Lianhao Lu &lt;lianhao.lu@intel.com&gt;
Co-authored-by: srinarayan-srikanthan &lt;srinarayan.srikanthan@intel.com&gt;
Co-authored-by: pre-commit-ci[bot] &lt;66853113+pre-commit-ci[bot]@users.noreply.github.com&gt;
Co-authored-by: Letong Han &lt;106566639+letonghan@users.noreply.github.com&gt;
Co-authored-by: XinyuYe-Intel &lt;xinyu.ye@intel.com&gt;
Co-authored-by: chen, suyue &lt;suyue.chen@intel.com&gt;
Co-authored-by: Zhenzhong1 &lt;109137058+Zhenzhong1@users.noreply.github.com&gt;
Co-authored-by: WenjiaoYue &lt;wenjiao.yue@intel.com&gt;
Co-authored-by: Lianhao Lu &lt;lianhao.lu@intel.com&gt;
Co-authored-by: Ying Hu &lt;ying.hu@intel.com&gt;
diff --git a/ChatQnA/README.md b/ChatQnA/README.md
@@ -97,6 +97,21 @@ flowchart LR
 
 This ChatQnA use case performs RAG using LangChain, Redis VectorDB and Text Generation Inference on Intel Gaudi2 or Intel XEON Scalable Processors. The Intel Gaudi2 accelerator supports both training and inference for deep learning models in particular for LLMs. Visit [Habana AI products](https://habana.ai/products) for more details.
 
+In the below, we provide a table that describes for each microservice component in the ChatQnA architecture, the default configuration of the open source project, hardware, port, and endpoint.
+
+<details>
+<summary><b>Gaudi default compose.yaml</b></summary>
+
+| MicroService | Open Source Project | HW    | Port | Endpoint             |
+| ------------ | ------------------- | ----- | ---- | -------------------- |
+| Embedding    | Langchain           | Xeon  | 6000 | /v1/embaddings       |
+| Retriever    | Langchain, Redis    | Xeon  | 7000 | /v1/retrieval        |
+| Reranking    | Langchain, TEI      | Gaudi | 8000 | /v1/reranking        |
+| LLM          | Langchain, TGI      | Gaudi | 9000 | /v1/chat/completions |
+| Dataprep     | Redis, Langchain    | Xeon  | 6007 | /v1/dataprep         |
+
+</details>
+
 ## Deploy ChatQnA Service
 
 The ChatQnA service can be effortlessly deployed on either Intel Gaudi2 or Intel XEON Scalable Processors.