Skip to content
This repository has been archived by the owner on Jul 3, 2023. It is now read-only.

Latest commit

 

History

History
57 lines (44 loc) · 5.61 KB

README.md

File metadata and controls

57 lines (44 loc) · 5.61 KB

Acamedics

Shiny or similar development for AcaMedics

Path to Data Science (Short Version)

Below is a comprehensive path to learning data science, focussed around the R statistical computer language. We choose R because its powerful, free and has a thriving online community. We've tried a lot of different things and have listed what we feel were of most use. Everyone is different, so if you found something particularly useful, let us know so we can add it to the list.

There is no shortcut to advancing in this field. Find a project, something that you feel passionate about, and get stuck in; learn by doing. A fantastic reasource to start is the tidyverse and R for data science, availible for free online here: http://r4ds.had.co.nz

Skill development (The Long Version)

Below is a list of some areas that are important to develop on your journey into data science. These are aimed at someone who is interested and motivated, but from a clinical background, and so not formally trained in these methods. I have inlcuded links to amazon for books, but mostly just for reference. Ask for a link to a shared dropbox folder for electronic materials for the majority of them, plenty of others are published online for free. We also keep other copies in the lab which we are happy for you to borrow (please ask first).

Maths

  • The foundation of data science is rooted in probability and statistics. As clinicians this can be extremely limiting in the long run as we typically have at most an A-level in maths (often from a very long time ago). Key areas to focus on are:
    • Calculus
    • Probability
    • Linear algebra
    • Set theory and statistical notation
  • Once you have these basic tools, you will have the basic skills to understand the language of statistics and make the most out of your data (... and get away from hypothesis testing!). All that is needed here is a conceptual overview of how these things work.

The following resources are extremely useful:

Other resources for statistics:

Online Courses

There has been an explosion in the availibility of online courses. They vary from free to expensive, and their quality is often not related to cost. Here we compile a list of the best:

Name Subject Website Cost Rating
Essence of Linear Algebra Linear Algebra https://www.youtube.com/channel/UCYO_jab_esuFRV4b17AJtAw/playlists Free 5/5
The World of Maths General Maths at all levels www.khanacademy.com Free 5/5
Machine Learning - Coursera ML https://www.coursera.org/learn/machine-learning £58 Recommended
Maths for Machine Learning Linear Algebra https://www.coursera.org/learn/linear-algebra-machine-learning/home/welcome £48 4/5

R

Python

  • We need some recommendations here

UNIX and VIM

Lab Meetings

  • We meet every Monday morning to discuss ongoing projects. Everyone is welcome to attend and watch or present. No matter the size, scale or complexity, we would encourge you to present any work as a way to crowd source solutions to problems and figure out what the next step might be.