R Analysis Skeleton

No one beginning a data science project should start from a blinking cursor.
...Templatization is a best practice for things like using common directory structure across projects...
-Megan Risdal Kaggle Product Lead.

This project contains the files and settings commonly used in analysis projects with R. A developer can start an analysis repository more quickly by copying these files. The purpose of each directory is described in its README file. Some aspects are more thoroughly described in Collaborative Data Science Practices.

Pipelines

The repo contains two pipelines that aim to be simple enough to understand, yet complex enough to mimic aspects frequently seen in analysis projects.

Cars

The simplest example involves a csv that is lightly groomed and saved as an rds file. A knitr Rmd file analyzes the rds; the text, graphs, and tables are saved as a self-contained html. The html file is veyr protable; it can be saved on a drive, emailed to a colleauge, or publically served on a website.

Intra-individual Differences

Most nontrivial data science projects require multiple sources to address a single issue. This example uses three sources: (a) longitudinal measurements for individuals across time (mlm.csv), (b) static county characteristics (county.csv), and (c) longintudinal county-level characteristics (te.csv). Each csv is independently groomed and loaded into its own database table (in db.sqlite) by an ellis lane. Conventional statistical software is not designed to digest multiple data rectangles; a scribe transforms multple database-normalized tables into a single rds that can be analyzed directly. In this case, the mlm.rds supports two analyses: a conventional report of statistical inferences intended for subject-experts concerned with complex hypotheses, and a dashboard of simplified patterns intended for administrators concerned with operational progress. The te.rds supports a comparison of the time and effort results between counties.

Name		Name	Last commit message	Last commit date
Latest commit History 325 Commits
analysis		analysis
data-public		data-public
data-unshared		data-unshared
documentation		documentation
manipulation		manipulation
stitched-output		stitched-output
utility		utility
.Rbuildignore		.Rbuildignore
.gitattributes		.gitattributes
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NEWS		NEWS
RAnalysisSkeleton.Rproj		RAnalysisSkeleton.Rproj
README.md		README.md
config.yml		config.yml
flow.R		flow.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

R Analysis Skeleton

Pipelines

Cars

Intra-individual Differences

About

Releases

Packages

Languages

License

xkcococo/RAnalysisSkeleton

Folders and files

Latest commit

History

Repository files navigation

R Analysis Skeleton

Pipelines

Cars

Intra-individual Differences

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages