Introduction: A toolchain for making data products
- What you will learn
- Why write a book for a toolchain?
- Who this book is for
- How we know this toolchain works
- Where do we find data?
- What tools will I need?
- After reading this book, you should feel better about…
- What this book covers
- What this book doesn’t cover
- How this book is structured
- What you’ll walk away with
Part 1: ‘Good enough’ data skills
- Why ‘good enough’?
- How to share your work
- ‘Good enough’ communication
- Get ‘good enough,’ then go for more if you need it
Part 2: “Have a workflow.”
- Principle 1: Use open-source software
- Principle 2: Write code
- Principle 3: Document everything in plain text
- Rmarkdown
- Additional reasons for using R & RStudio
Part 3: Setting up your data science process
- Example: FiveThirtyEight’s 2019 Presidential debate project
- The Command line: Unix and Windows
- Good enough command-line tools
- Command line recap
Part 4: Keep track of changes with version control
- Tracing our steps
- Git
- Setting up Git
Part 5: RStudio.Cloud
- What have we done so far?
- Recap on RStudio panes
- Where should we write code?
- Rmarkdown
- Documenting our projects
- Import
- Document changes with Git
- Wrangle
- Visualize
- Knitting RMarkdown files
Part 6: Putting your project on Github
- What we’ve done
- Sharing our work with Github pages
- Conclusion
Appendix
- Introduction: A toolchain for making data products
- Chapter 1
- Chapter 2
- Chapter 3
- Chapter 4
- Chapter 5
- Chapter 6