You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A command-line tool to concatenate a directory full of files into a single prompt for use with Large Language Models (LLMs).
6
+
A command-line tool to concatenate files and directories in a structured way to a single prompt for use with large language models and other applications.
7
7
8
8
## Description
9
9
10
-
`files-to-prompt.ts` is a stand-alone, dependency free[^1] script that allows you to combine multiple text or code files into a single, continuous stream of content. This can be useful when working with LLMs, where you may want to provide a comprehensive set of information as input to the model, rather than individual files.
11
-
12
-
The tool supports processing both individual files and entire directories, and provides options to include or exclude hidden files, ignore `.gitignore` rules, and specify custom patterns to ignore. Only a simple subset of `.gitignore` patterns are supported.
10
+
`files-to-prompt.ts` is a stand-alone, dependency free[^1] script that combines multiple text or code files into a single, continuous stream of content. This can be useful when working with LLMs, where you may want to provide a comprehensive set of information as input to the model, rather than individual files.
13
11
14
12
This is a TypeScript port of the original `files-to-prompt` tool written in Python by Simon Willison, which is available at [https://github.com/simonw/files-to-prompt](https://github.com/simonw/files-to-prompt).
15
13
16
-
Additional features not currently found the original version:
17
-
- reading and parsing file paths received via `stdin` (e.g. via pipe from `grep` or `ripgrep`)
18
-
- redirect output to a file with `--output <file>` or `-o <file>`
19
-
- convert Jupyter Notebook `*.ipynb` files on the fly to ascii or markdown with [nbconvert](https://nbconvert.readthedocs.io/en/latest/index.html)
14
+
[^1]: The script needs a TypeScript engine to run, of course.
15
+
16
+
## Features
20
17
21
-
[^1]: The script needs a Typescript engine to run, of course.
18
+
- Extracts the contents of text files and presents them in a formatted way
19
+
- Supports including or excluding hidden files and directories
20
+
- Allows ignoring files based on patterns or .gitignore rules
21
+
- Converts Jupyter Notebook (.ipynb) files to ASCII or Markdown
22
+
- Supports redirecting output to a file
23
+
- Runs on the Bun runtime environment
22
24
23
25
## Installation
24
26
25
-
1.To use `files-to-prompt.ts`out-of-the-box, you'll need to have [Bun](https://bun.sh/) installed on your system.
27
+
To use the `files-to-prompt.ts`tool, you'll need to have the Bun TypeScript engine installed on your system. You can download and install Bun from the official website: [https://bun.sh/](https://bun.sh/)
3. Make the script executable with `chmod +x ftp.ts`
40
-
4. Move `ftp.ts` to a location where it is accessible from your system's `$PATH` (optional)
47
+
Don't forget to make the script executable with `chmod +x ftp.ts` and move it to a location where it is accessible from your system's `$PATH` (optional).
41
48
42
49
## Usage
43
50
44
-
Run `ftp.ts` like a standard command:
51
+
### Command-line Arguments
52
+
53
+
The tool accepts the following command-line arguments:
45
54
46
55
```
47
-
ftp.ts <paths...> [options]
56
+
--version Output the version of the tool
57
+
--include-hidden Include hidden files and directories
58
+
--ignore-gitignore Ignore .gitignore files
59
+
-i, --ignore <pattern> Ignore files matching the specified pattern
60
+
-o, --output <file> Redirect output to the specified file
61
+
--nbconvert <command> Specify the nbconvert command to use for .ipynb files
62
+
--format <format> Specify the format to convert .ipynb files to ('asciidoc' or 'markdown')
63
+
[paths] One or more file or directory paths to process
48
64
```
49
65
50
-
Replace `<paths...>` with one or more paths to files or directories you want to process. The tool will recursively process all files within the specified directories.
66
+
### Input from Stdin
51
67
52
-
Available options:
68
+
The tool can also accept file paths from stdin. The input can be in the following formats:
53
69
54
-
- `--version`: Version of this script
55
-
- `--include-hidden`: Include files and folders starting with `.` (default: `false`)
56
-
- `--ignore-gitignore`: Ignore `.gitignore` files and include all files (default: `false`)
57
-
- `-i, --ignore <pattern>`: Specify one or more patterns to ignore (can be used multiple times)
58
-
- `-o, --output <file>`: Redirect output to `file` (**Note:** `file` will get silently overwritten)
59
-
- `--nbconvert <toolname>`: Name or full path of the installed `nbconvert` tool, e.g. `jupyter-nbconvert`
60
-
- `--format [asciidoc | markdown]`: Output format for the `nbconvert` tool, defaults to `asciidoc`
70
+
- One file path per line
71
+
- File paths with a colon separator, as in the output of grep or ripgrep
61
72
62
-
Jupyter Notebook conversion gets triggered when the `--nbconvert <toolname>` option is set, otherwise `.ipynb` files will get included verbatim in their `JSON` format (including `base64` encoded images etc). Each file conversion will happen in a freshly created temporary directory which will get deleted after each conversion.
73
+
### Output Configuration
63
74
64
-
Example usage:
75
+
By default, the tool outputs the file contents to the console. You can redirect the output to a file using the `--output` or `-o` option.
76
+
77
+
### Notebook Conversion
78
+
79
+
If the `--nbconvert` option is provided, the tool will attempt to convert any `.ipynb` (Jupyter Notebook) files to the specified format (`--format asciidoc` or `--format markdown`) using the provided `nbconvert` command name or path. By default `asciidoc` will be used.
80
+
81
+
Without conversion (or if the provided command cannot be found) `.ipynb` files will get included verbatim in their `JSON` format (including `base64` encoded images etc).
82
+
83
+
Internally each file conversion happens in a freshly created temporary directory (via `mktemp` API), which gets deleted automatically after successful (or unsucessful) completion.
llm -s "Update README.md to current state of the project"
89
+
llm -s "Check the provided code for spelling errors, inconsistencies, dead code and other minor issues I might have overlooked."
69
90
```
70
91
71
-
This will concatenate all files (including hidden files) in the `./my-project` directory, excluding any files matching the `*.lockdb` or `*.env` patterns, and then send the result to the [llm](https://llm.datasette.io/en/stable/) command, which adds a system prompt and sends it to an LLM model for processing.
92
+
This will concatenate all files (including hidden files) in the `./my-project` directory, excluding any files matching the `*.lockdb` or `*.env` patterns, and then send the result to the [llm](https://llm.datasette.io/en/stable/) command, which adds a system prompt and sends it to a LLM API for processing.
> **Note:** Warnings and errors will get sent to `stderr`, piping the output to another command is safe
105
+
106
+
> **Note:** Warnings and errors will get sent to `stderr`, piping the output to another command is safe. The script tries its best to keep the output stream clean.
85
107
86
108
## Testing
87
109
88
-
This tool includes a set of tests to ensure it works as expected. You can run the tests using the following command:
110
+
This repository includes a comprehensive test script to ensure `files-to-prompt.ts` works as expected. You can run the tests using the following command:
0 commit comments