StatsCanPy

Basic package for querying & downloading StatsCan data by table name. Saves data into a dataframe (Pandas or PySpark).

Allows for querying datasets via plain text search or table ID.

Installation

pip install statscanpy

Usage

Basic Usage

  from statscanpy import StatsCanPy

  # if isSpark==True, data returns will be in PySpark; otherwise it will return as a pandas.DataFrame
  statscan = StatsCanPy(path="/data/saved/here", isSpark=True)

Getting Table ID from Table Name

  statscan.get_table_id_from_name("Railway industry operating statistics by mainline companies")
  >>> TOP MATCH:
      Railway industry operating statistics by mainline companies: 23-10-0055-01
      Accessible at: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=2310005501

Getting Table Data from Table Name

  await statscan.get_table_from_name("Household spending, Canada, regions and provinces")
  >>> TOP MATCH:
      Household spending, Canada, regions and provinces: 11-10-0222-01
      Accessible at: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=1110022201
      DataFrame[REF_DATE: date, GEO: string, ...]

Searching for Table(s) by String

  statscan.find_table_id_from_name("GDP", limit=15)
  >>> TOP 15 MATCHES:
      1. Gross domestic product (GDP) at basic prices, by industry, monthly, growth rates: 36-10-0434-02
      Accessible at: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3610043402
      2. Gross domestic product, expenditure-based, provincial and territorial, annual: 36-10-0222-01
      Accessible at: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3610022201
      ...

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.github/workflows		.github/workflows
src/statscanpy		src/statscanpy
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StatsCanPy

Installation

Usage

Basic Usage

Getting Table ID from Table Name

Getting Table Data from Table Name

Searching for Table(s) by String

Further Reading

About

Releases 10

Packages

Languages

License

deepwaterpaladin/statscanpy

Folders and files

Latest commit

History

Repository files navigation

StatsCanPy

Installation

Usage

Basic Usage

Getting Table ID from Table Name

Getting Table Data from Table Name

Searching for Table(s) by String

Further Reading

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 10

Packages 0

Languages

Packages