Skip to content
This repository has been archived by the owner on Apr 28, 2024. It is now read-only.

Appears to be incompatible with the Ingest Attachment plugin #6

Open
falxen opened this issue Aug 29, 2022 · 1 comment
Open

Appears to be incompatible with the Ingest Attachment plugin #6

falxen opened this issue Aug 29, 2022 · 1 comment

Comments

@falxen
Copy link

falxen commented Aug 29, 2022

When the ingest attachment plugin is used to extract a PDF file, the content extracted can't be processed by inelastic. E.g. a PDF file that simply contains one word "Testing", when ingested, inelastic responds as follows:

inelastic -i index_name -f data -o csv

term,freq,doc_count,d0
08dk3jkl4n,1,1,3306e7fb05ba68fa
0g4f7pxvs9xivfdavfvge,1,1,3306e7fb05ba68fa
0nhjxt,1,1,3306e7fb05ba68fa
119gxv8pfpi7,1,1,3306e7fb05ba68fa
16fjzozxv14lkxhyc9sbjjioi63lu9onr5jro8orqzogmbuozccllycdf3f9n8tgycb3spqxzbqquu7ipst6plptdtuzaxobwiwxrxbplz8xkh77ep9o14,1,1,3306e7fb05ba68fa
1d1cogrcdfe,1,1,3306e7fb05ba68fa
1ns12pfuhed1funs7a73dc5f1yy4xlqt2dew9wvbxesmc,1,1,3306e7fb05ba68fa
1yynr9kbmzebhrgm2cw6ukc00ogxed4z1qplbmrzdhjlyw0kzw5kb2jqcgozidagb2jqcjewnaplbmrvymokcjugmcbvymokpdwvtgvuz3roidygmcbsl0zpbhrlci9gbgf0zurly29kzs9mzw5ndggxideynzq0pj4kc3ryzwftcnic5vonfftvlb9vzrxjmkkmmyejibbekgb5mewgieiknznkkozky5kjx1xkzeylgcjmm84hawkklyuogns,1,1,3306e7fb05ba68fa
2,1,1,3306e7fb05ba68fa
26ci7mjisgccupt87usjg8pvcr0m5j1i7nf1j22mj2mjzqvvh6e2x8ef6tdrlwnpnz9934qm6pfbkzrw9pm,1,1,3306e7fb05ba68fa
2sbfyxwtqrfohxvw9cppe3ha4xwfiunn7cit9baexb46l6b,1,1,3306e7fb05ba68fa
3044uoxcal,1,1,3306e7fb05ba68fa
348z,1,1,3306e7fb05ba68fa
38mhj4uemx7,1,1,3306e7fb05ba68fa
39m,1,1,3306e7fb05ba68fa
3myfv7se55,1,1,3306e7fb05ba68fa

(I truncated the response. There are hundreds of lines like this instead of just one word 'Testing')

Any hints, please?

@federicotdn
Copy link
Owner

Hi! Unfortunately it's been some years since I've used Elasticsearch, so I can't help in this case.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants