Skip to content

Commit

Permalink
Update FEATURE_COMPARISON.md (#215)
Browse files Browse the repository at this point in the history
Full HTML extraction is available in latest
  • Loading branch information
navarone-feekery authored Feb 21, 2025
1 parent e1e1ab2 commit 12e145e
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/FEATURE_COMPARISON.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ The following table compares the features of Open Crawler to those of Elastic Cr
| Crawler directives — `robots.txt`, sitemaps, robots meta tags, canonical URLs, nofollow links | [Yes](./features/CRAWLER_DIRECTIVES.md) | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-content.html) |
| Scheduling | [Yes](../README.md#scheduling-recurring-crawl-jobs) | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-schedule) |
| Extraction using data attributes and meta tags | No, _planned for `v0.3`_ | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-content.html#crawler-content-meta-tags-content-extraction) |
| Full HTML extraction | No, _planned for `v0.3`_ | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-html-storagedocuments) |
| Full HTML extraction | Yes | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-html-storagedocuments) |
| Event logging in Elasticsearch | No, _planned for `v0.3`_ | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-view-events-logs.html) |
| Duplicate content handling | No | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-duplicate-documents) |
| Crawl result history and metadata | No | Yes |

0 comments on commit 12e145e

Please sign in to comment.