Documentation

Conventions

dataset-name_time-granularity_grouping_year.extension

Descriptors

Examples:

Functions and prior scripts should be referenced at the top of every file.
Each file should do one thing.
- Functions or complicated cleaning or scraping should be put into its own script.
  - Often is a .R or .PY script
- There can be a file that calls or references previous files. (One file to run it all or to produce final outputs)
  - Often is a .RMD or .IPYNB or bash file
Avoid hard-coding values into your code.
Recommend DRY code. Turn repeated code into a function that takes the changes as parameters.
Files that depend on other files require a numbered file name.

There are two types of repositories: project and collection.

For project repositories:

Each data request is its own repo.
The Readme, About, and Tags are filled out.
No passwords are ever shown on a repo.
If there is no private data, then repo is destined to be public.
When cleaning up a repo, old, unused code/data/visuals can be placed into an 'archived' called folder.