@@ -55,19 +55,61 @@ eggNOG database version 6.0 orthomaps
55
55
56
56
Extracted orthomaps for all Eukaryota from `eggNOG database version 6.0 <http://eggnog6.embl.de/#/app/home >`_ can be downloaded here:
57
57
58
- `eggnog6_eukaryota_orthomaps.tsv.zip <https://zenodo.org/record/8360098/files/eggnog6_eukaryota_orthomaps.tsv.zip >`_
58
+ `eggnog6_eukaryota_orthomaps.tsv.zip <https://zenodo.org/records/14911022/files/eggnog6_eukaryota_orthomaps.tsv.zip >`_
59
+
60
+ .. code-block :: bash
61
+
62
+ # to get GTF from Mus musculus on Linux run:
63
+ wget https://zenodo.org/records/14911022/files/eggnog6_eukaryota_orthomaps.tsv.zip
64
+
65
+ # on Mac:
66
+ curl https://zenodo.org/records/14911022/files/eggnog6_eukaryota_orthomaps.tsv.zip --remote-name
59
67
60
68
To get an orthomap for e.g. the species *Caenorhabditis elegans * (taxID: 6239):
61
69
62
- ::
70
+ .. code-block :: python
63
71
64
- from oggmap import qlin, gtf2t2g, of2orthomap, orthomap2tei, datasets
65
72
import pandas as pd
73
+ from oggmap import qlin, gtf2t2g, of2orthomap, orthomap2tei, datasets, ncbitax
66
74
eggnog6_eukaryota_orthomaps = pd.read_csv(' eggnog6_eukaryota_orthomaps.tsv.zip' , delimiter = ' \t ' )
67
- query_lineage = qlin.get_qlin(q='Caenorhabditis elegans')
75
+ query_lineage = qlin.get_qlin(q = ' Caenorhabditis elegans' , dbname = ' taxadb.sqlite' )
76
+
77
+ .. code-block :: console
78
+
79
+ >>> query name: Caenorhabditis elegans
80
+ query taxID: 6239
81
+ query kingdom: Eukaryota
82
+ query lineage names:
83
+ ['root(1)', 'cellular organisms(131567)', 'Eukaryota(2759)', 'Opisthokonta(33154)',
84
+ 'Metazoa(33208)', 'Eumetazoa(6072)', 'Bilateria(33213)', 'Protostomia(33317)',
85
+ 'Ecdysozoa(1206794)', 'Nematoda(6231)', 'Chromadorea(119089)', 'Rhabditida(6236)',
86
+ 'Rhabditina(2301116)', 'Rhabditomorpha(2301119)', 'Rhabditoidea(55879)',
87
+ 'Rhabditidae(6243)', 'Peloderinae(55885)', 'Caenorhabditis(6237)', 'Caenorhabditis elegans(6239)']
88
+ query lineage:
89
+ [1, 131567, 2759, 33154, 33208, 6072, 33213, 33317, 1206794, 6231, 119089, 6236,
90
+ 2301116, 2301119, 55879, 6243, 55885, 6237, 6239]
91
+
92
+ .. code-block :: python
93
+
68
94
query_orthomap = eggnog6_eukaryota_orthomaps[eggnog6_eukaryota_orthomaps[' taxID' ]== query_lineage[1 ]]
69
95
query_orthomap
70
96
97
+ .. code-block :: console
98
+
99
+ >>> taxID name seqID ... PStaxID PSname PScontinuity
100
+ 13301320 6239 Caenorhabditis elegans 6239.C55B7.6a.1 ... 131567 cellular organisms 1.0
101
+ 13301321 6239 Caenorhabditis elegans 6239.F14D12.5.1 ... 131567 cellular organisms 1.0
102
+ 13301322 6239 Caenorhabditis elegans 6239.F41D9.5.1 ... 131567 cellular organisms 1.0
103
+ 13301323 6239 Caenorhabditis elegans 6239.K12G11.1.1 ... 131567 cellular organisms 1.0
104
+ 13301324 6239 Caenorhabditis elegans 6239.K12G11.2.1 ... 131567 cellular organisms 1.0
105
+ ... ... ... ... ... ... ... ...
106
+ 13319237 6239 Caenorhabditis elegans 6239.R09E12.8.1 ... 6237 Caenorhabditis 1.0
107
+ 13319238 6239 Caenorhabditis elegans 6239.F39H2.1.1 ... 119089 Chromadorea 1.0
108
+ 13319239 6239 Caenorhabditis elegans 6239.C32D5.9.1 ... 2759 Eukaryota 1.0
109
+ 13319240 6239 Caenorhabditis elegans 6239.ZK593.6a.1 ... 2759 Eukaryota 1.0
110
+ 13319241 6239 Caenorhabditis elegans 6239.F29C12.3b.1 ... 6231 Nematoda 1.0
111
+
112
+ [17922 rows x 8 columns]
71
113
72
114
.. _tutorial-pre-calculated-orthomaps-plaza :
73
115
0 commit comments