-
Notifications
You must be signed in to change notification settings - Fork 4
/
Copy pathTaxonomy.html
216 lines (215 loc) · 9.78 KB
/
Taxonomy.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>CRAN Task View: Taxonomy</title>
<link rel="stylesheet" type="text/css" href="../CRAN_web.css">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<meta name="citation_title" content="CRAN Task View: Taxonomy">
<meta name="citation_author" content="Scott Chamberlain">
<meta name="citation_publication_date" content="2020-11-11">
<meta name="citation_public_url" content="https://CRAN.R-project.org/view=Taxonomy">
<meta name="DC.title" content="CRAN Task View: Taxonomy">
<meta name="DC.creator" content="Scott Chamberlain">
<meta name="DC.issued" content="2020-11-11">
<meta name="DC.identifier" content="https://CRAN.R-project.org/view=Taxonomy">
</head>
<body>
<h2>Taxonomy</h2>
<div>
<p>
This article is about taxonomy in R.
</p>
<p>
If you have any comments or suggestions for additions or improvements for this article
<a href="https://github.com/ropensci/taxonomy/issues">
submit an issue
</a>, or make some changes and
<a href="https://github.com/ropensci/taxonomy/pulls">
submit a pull request
</a>. If you have an issue with one of the packages discussed below, please contact the maintainer of that package.
</p>
<h2 id="taxonomic-data">
Taxonomic Data
</h2>
<p>
The following packages mostly deal with getting taxonomic data from the web onto your machine.
</p>
<ul>
<li>
<a href="../packages/taxize/index.html">taxize</a>
access to 20ish sources of taxonomic data sources. This is the place to go for most taxononomic data.
<a href="../packages/taxize/index.html">taxize</a>
connects to a lot of data sources, and has consistent data outputs across the data sources. In addition, there’s operations that a user wants to do that are consistent across data sources, hiding the gory details of each data source. The only caveat is that if you have a really slow internet connection or you are dealing with A LOT of names, then you may want to use
<a href="../packages/taxizedb/index.html">taxizedb</a>.
</li>
<li>
<a href="../packages/taxizedb/index.html">taxizedb</a>
came out of the
<a href="../packages/taxize/index.html">taxize</a>
package, with sights set on helping people work with larger taxonomic data sets, or that perhaps have infrequent access to the internet.
<a href="../packages/taxizedb/index.html">taxizedb</a>
downloads taxonomic database dumps from many different providers - and makes it easy to either query them with SQL or plug into
<tt>dplyr</tt>
package to use that interface.
<a href="../packages/taxizedb/index.html">taxizedb</a>
is starting to gain some of the functionality of
<a href="../packages/taxize/index.html">taxize</a>
(e.g., get a taxonomic classification) but without having to do web requests, and doing large set of them much faster.
</li>
<li>
<a href="../packages/ritis/index.html">ritis</a>
is a client for the Integrated Taxonomic Information System (ITIS) database of taxonomic data. ITIS is run by USGS in the US. It provides access to ITIS’s Solr web service (think for search), as well as their REST API more for fetching data for specific things by name or taxonomic ID.
</li>
<li>
<a href="../packages/wikitaxa/index.html">wikitaxa</a>
is a client for Wikipedia, Wikicommons, Wikspecies, and Wikidata taxonomic data. It is sometimes a bit odd since there is no interface specifically for taxonomic data besides Wikispecies, so you can end up with results that are not really taxonomic as well - but we do our best.
</li>
<li>
<a href="../packages/worrms/index.html">worrms</a>
client for the
<a href="http://www.marinespecies.org/">
WoRMS (World Register of Marine Species)
</a>
API. Contains mostly taxonomic data, but also trait data.
</li>
<li>
<a href="../packages/Taxonstand/index.html">Taxonstand</a>
data from The Plant List. This package takes in plant taxon names, and downloads CSV files from The Plant List website, then presents those as a data.frame within R.
</li>
</ul>
<h2 id="taxonomic-classes">
Taxonomic Classes
</h2>
<p>
The
<a href="../packages/taxa/index.html">taxa</a>
package defines a set of taxonomic objects (using S3 and R6) for both use cases where only taxonomic data is of interest, as well as when one has taxonomic data combined with other data on the taxa.
<a href="../packages/taxa/index.html">taxa</a>
aims to form the basis upon which other taxonomic packages can be built, using common classes.
</p>
<h2 id="manipulatingparsing-taxonomic-names">
Manipulating/Parsing Taxonomic Names
</h2>
<p>
The following packages don’t fetch taxonomic dat as those in the above section, but are focused around providing tooling around taxonomic names and data.
</p>
<ul>
<li>
<a href="../packages/rgnparser/index.html">rgnparser</a>
is a thin client for the Go based library
<a href="https://gitlab.com/gogna/gnparser">
gnparser
</a>
from the GlobalNamesArchitecture project, which uses a Parsing Expression Grammar (PEG) to parse taxonomic names very quickly.
</li>
<li>
<a href="../packages/metacoder/index.html">metacoder</a>
specializes in metabarcoding. It can parse, manipulate, and visualize metabarcoding/taxonomic data. It leverages the
<a href="../packages/taxa/index.html">taxa</a>
package (
<a href="https://github.com/grunwaldlab/metacoder">
GitHub
</a>)
</li>
<li>
<a href="https://github.com/ropensci/taxview">
taxview
</a>
is a in development package to help users summarize taxonomic data. Summarizing so far is taking form of getting taxonomic hierarchy data (via the
<a href="../packages/taxize/index.html">taxize</a>
package) to facilitate taxonomic summaries. Visualization isn’t done yet.
</li>
</ul>
<h2 id="handling-taxonomic-name-lists">
Handling Taxonomic Name Lists
</h2>
<p>
There are a few packages that deal specifically with handling species lists:
</p>
<ul>
<li>
<a href="https://github.com/sckott/splister">
splister
</a>
- match species list against a reference list. Still in development
</li>
</ul>
<h2 id="ncbi-data">
NCBI data
</h2>
<ul>
<li>
NCBI taxonomic data is available from a number of R packages.
<a href="../packages/taxizedb/index.html">taxizedb</a>
provides access to local version of NCBI’s taxonomy - in addition to taxonomies for other data sources (see above).
<a href="../packages/ncbit/index.html">ncbit</a>
makes NCBI taxonomic data locally available and searchable as an R object (a
<tt>data.frame</tt>). The package comes with a version from 2013, but you can choose to update it; although updating it still seemed to use the old version from 2013.
<a href="../packages/taxonomizr/index.html">taxonomizr</a>
contains functions for assigning taxonomy to NCBI accession numbers and taxon IDs based on NCBI’s
<tt>accession2taxid</tt>
and
<tt>taxdump</tt>
files.
</li>
<li>
<a href="../packages/microclass/index.html">microclass</a>
has functions for assigning 16S sequence data to a taxonomic level in the tree-of-life for prokaryotes.
</li>
</ul>
<h2 id="specialized-packages">
Specialized packages
</h2>
<ul>
<li>
<a href="../packages/monographaR/index.html">monographaR</a>
functions to facilitate the production of plant taxonomic monographs
</li>
<li>
<a href="../packages/taxlist/index.html">taxlist</a>
has functions to import species lists from
<a href="https://www.synbiosys.alterra.nl/turboveg/">
Turboveg
</a>, a database management system for vegetation data
</li>
<li>
<a href="../packages/vegdata/index.html">vegdata</a>
has functions to get taxonomic data from
<a href="https://www.synbiosys.alterra.nl/turboveg/">
Turboveg
</a>, and
<a href="https://www.vegetweb.de/">
VegetWeb
</a>
(German plant data)
</li>
</ul>
</div>
<h3>CRAN packages:</h3>
<ul>
<li><a href="../packages/metacoder/index.html">metacoder</a></li>
<li><a href="../packages/microclass/index.html">microclass</a></li>
<li><a href="../packages/monographaR/index.html">monographaR</a></li>
<li><a href="../packages/ncbit/index.html">ncbit</a></li>
<li><a href="../packages/rgnparser/index.html">rgnparser</a></li>
<li><a href="../packages/ritis/index.html">ritis</a></li>
<li>
<a href="../packages/taxa/index.html">taxa</a> (core)</li>
<li>
<a href="../packages/taxize/index.html">taxize</a> (core)</li>
<li><a href="../packages/taxizedb/index.html">taxizedb</a></li>
<li><a href="../packages/taxlist/index.html">taxlist</a></li>
<li><a href="../packages/taxonomizr/index.html">taxonomizr</a></li>
<li><a href="../packages/Taxonstand/index.html">Taxonstand</a></li>
<li><a href="../packages/vegdata/index.html">vegdata</a></li>
<li><a href="../packages/wikitaxa/index.html">wikitaxa</a></li>
<li><a href="../packages/worrms/index.html">worrms</a></li>
</ul>
<h3>Related links:</h3>
<ul>
<li><a href="https://cran.rstudio.com/web/views/Phylogenetics.html">
Phylogenetics Task View
</a></li>
</ul>
</body>
</html>