You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
DEFAULT_PROMPT="Please analyze this image and provide a markdown representation of its content. Include headings, lists, and any other relevant markdown formatting."
12
+
DEFAULT_PROMPT="""
13
+
Extract the full markdown text from the given image, following these guidelines:
14
+
- Respond only with markdown, no additional commentary.
15
+
- Capture all the text, respecting titles, headers, subheaders, equations, etc.
16
+
- If there are tables in this page, convert each one into markdown table format and include it in the response.
17
+
- If there are images, provide a brief description of what is shown in each image, and include it in the response.
18
+
- if there are charts, for each chart include a markdown table with the data represents the chart, a column for each of the variables of the cart and the relevant estimated values
base_url (str): You can use this one to point the client whereever you need it like Ollama
111
120
prompt (str, optional): The prompt to send to the API. Defaults to DEFAULT_PROMPT.
112
-
121
+
pages_list (list, optional): A list of page numbers to process. If provided, only these pages will be converted. Defaults to None, which processes all pages.
113
122
Returns:
114
123
list: A list of strings, each containing the markdown representation of a PDF page.
0 commit comments