diff --git a/docs/FEATURE_COMPARISON.md b/docs/FEATURE_COMPARISON.md index b0cd4ee..24267be 100644 --- a/docs/FEATURE_COMPARISON.md +++ b/docs/FEATURE_COMPARISON.md @@ -19,7 +19,7 @@ The following table compares the features of Open Crawler to those of Elastic Cr | Crawler directives — `robots.txt`, sitemaps, robots meta tags, canonical URLs, nofollow links | [Yes](./features/CRAWLER_DIRECTIVES.md) | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-content.html) | | Scheduling | [Yes](../README.md#scheduling-recurring-crawl-jobs) | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-schedule) | | Extraction using data attributes and meta tags | No, _planned for `v0.3`_ | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-content.html#crawler-content-meta-tags-content-extraction) | -| Full HTML extraction | No, _planned for `v0.3`_ | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-html-storagedocuments) | +| Full HTML extraction | Yes | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-html-storagedocuments) | | Event logging in Elasticsearch | No, _planned for `v0.3`_ | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-view-events-logs.html) | | Duplicate content handling | No | [Yes](https://www.elastic.co/guide/en/enterprise-search/current/crawler-managing.html#crawler-managing-duplicate-documents) | | Crawl result history and metadata | No | Yes |