The Big R-Book for Professionals, From Data Science to Learning Machines for the Profssional
by Philippe J.S. De Brouwer
– Published by John Wiley & Sons –
This book is written for people that want to build models, apply statistics, use machine learning, manipulate data, present results, and convince other people. It isa story about about data, and how to use it successfully in a private company. The book aims at increasing your personal brand, your personal value, as well as shareholder value.
Along the way you can learn something about statistical models, muliti criteria decision analysis, machine learning and artifcial intelligence, and much more … and programming in R. The book takes a pragmatic stance and gets you started. We will even explain you how to use big data environments such as Hadoop and Spark, create interactive websites, make efficient presentations, and speed up your code.
Contents of This Page
- The Parts of the Book
- Some Statistics about the Book
- Downloads, Sources, and Sample Material
- Sample Applications that You Learn to Build in the Book
- Get Your Copy
The Parts of the Book
- Introduction: the importance of data science
- Starting with R and Elements of Statistics: the basics
- Explore Data: databases and how to use them
- Data Import: getting data in R
- Data Wrangling: modifying data
- Modelling: building strong models and verify them, including popular machine learning techniques
- Introduction to Companies: what are companies, financial and management accounting, valuation of financial assets and multi criteria decision analysis
- Reporting: from automated slides to interactive websites
- Bigger and Faster R: using clusters, programming the GPU, big data, and code optimisation
- Appendices: extra code, notes, and levels of measurement
Some Statistics about the Book
- 803 R code examples or segments and 92 other code sections
- 145 illustrations
- 270 boxes with information, warnings, digressions, and further information
- 9 parts, front and back-matter
- 55 chapters, 129 sections, 223 subsections, 128 subsubsections
- 1,100+ pages
- 258,450 words in 34,438 paragraphs
Downloads, Sources, and Sample Material
- Samples (note: these samples are generate from the submission to the editor – so, while the words are the same, it might look different in the book itself):
- Examples of dynamic websites with R:
- An example of a dashboard (with flexdash)
- Simple simulator for the normal distribution (with shiny)
The same dashboard publised on: rpubs
- storyboard sample
- slides for teaching – soon available
- Would you like to copy-paste the code instead of typing it yourself? Then you want to download a copy of the code in the book.
Sample Applications that You Learn to Build in the Book
This book is about many things, such as databaes, big data, models, model validation, etc. Most of things are better seen and read in the book. Dynamic dashboards and interactive websites, however, are best experienced here on a website. Therefore, we offer you here some of the applications that the book teaches you to make.
The Static Diversity Dashboard
This example is a diversity dashboard for a company or team. The data from the HR team is presented from different angles. Don't forget to select the other tabs in the blue menu bar, view the code with the button on the right, … and even the button to share the dashboard on social media works. However, abouve all it is interesting to see the interaction that is possible with the plots. For example, click [gender], [salary], and then move the cursor over the plot to get the menu bar. You can now zoom in, see values, etc.
Alternatively, you can find the dashboard here: on rpubs
Simulation of the Normal Distribution
This simple example takes random numbers from the normal distribution and then plot the historgram. This is useful to experiment and get a feeling what sample size can do and how the binning of the historgram influences your perception of the distribution.
This is dicussed in the book in section 36.1, Shiny.