Below is a list of books written with **bookdown**, including those published to bookdown.org (books without substantial content are excluded) and a few hosted on external servers. The books are ordered roughly by date. An asterisk `*`

after a date indicates the date is unknown, which often means a `date`

field is missing in the YAML metadata of the source document `index.Rmd`

. The list of books is automatically generated. For more information (including how to add or remove your books on this page), please see the About page.

# Applet Codebook: NIMH EMA for Mindlogger v0.27

## by Mike X.

This is a codebook that documents all of the items in the current version of the NIMH EMA Applet for Mindlogger. […] This MindLogger applet collects daily information on your physical and mental health. You will be asked a set of questions multiple times a day. We will record the information and share it with you and our researchers so we can look for patterns in the data. Answer these questions to the best of your ability. It is okay if you don’t know the answers to some of them! Thank you for your participation! These questions were constructed as part of a collaboration between the … Read more →

# Applet Codebook: HBN EMA for Mindlogger v0.25

## by Mike X.

This is a codebook that documents all of the items in the current version of the NIMH EMA Applet for Mindlogger. […] This MindLogger applet collects daily information on your physical and mental health. You will be asked a set of questions multiple times a day. We will record the information and share it with you and our researchers so we can look for patterns in the data. Answer these questions to the best of your ability. It is okay if you don’t know the answers to some of them! Thank you for your participation! These questions were constructed as part of a collaboration between the … Read more →

# Notes for Predictive Modeling

## by Eduardo García Portugués

Notes for Predictive Modeling. MSc in Big Data Analytics. Carlos III University of Madrid. […] Welcome to the notes for Predictive Modeling for the course 2019/2020. The subject is part of the MSc in Big Data Analytics from Carlos III University of Madrid. The course is designed to have, roughly, one lesson per each main topic in the syllabus. The schedule is tight due to time constraints, which will inevitably make the treatment of certain methods a little superficial compared with what it would be the optimal. Nevertheless, the course will hopefully give you a respectable panoramic view … Read more →

# Skript zum MAS Modul Servotechnik

## by Martin Pischtschan

Skript Servotechnik […] Der Autor dieses Materials räumt allen nationalen Bildungseinrichtungen (wie z.B.: Schulen, Industrie u. Handelskammer, Fachhochschulen, Universitäten, etc.) ein unentgeltliches Nutzungsrecht ein. Dieses Nutzungsrecht ist auf den Unterrichtsgebrauch bzw. Vorlesungsgebrauch im Rahmen von Aus-, Fort- und Weiterbildung beschränkt. Der Nutzer ist verpflichtet, den Urheber anzugeben. Der Nutzer ist berechtigt das Material zu verändern, sofern er für dieses auch wieder allen nationalen Bildungseinrichtungen ein unentgeltliches Nutzungsrecht einräumt. Eine Veräußerung durch … Read more →

# MGHIHP HE-902, Spring 2020

## by Anshul Kumar

This e-book accompanies the course HE-902 in the PhD in HPEd program at MGHIHP (http://mghihp.edu/phdhped). HE-902 is a statistics course that equips students to analyze healthcare and/or behavioral data in R. […] This online e-book is the main resource to guide you through the course HE-902 in the PhD in HPEd program at MGHIHP in the Spring 2020 semester. Each chapter contains reading (or links to reading) that you should do as well as an assignment that you should complete and submit by the deadline in the course calendar. My name is Anshul Kumar and I am the author/preparer of this … Read more →

# My Data Science Notes

## by Michael Foley

This is a compendium of notes from classes, tutorials, etc. that I reference from time to time. […] These notes are pulled from various classes, tutorials, books, etc. and are intended for my own consumption. If you are finding this on the internet, I hope it is useful to you, but you should know that I am just a student and there’s a good chance whatever you’re reading here is mistaken. In fact, that should probably be your null hypothesis… or your prior. … Read more →

# 社会科学中的统计学

## by 王敏杰

一个简单的中文书示例。 […] R 语言作为当今最值得学习的数据科学语言，在社会科学中的应用方兴未艾。本课程《社会科学中的统计学》将介绍 R语言在探索性数据分析和推断性统计方法中的强大功能，并结合来自社会学、心理学、教育学、语言学等学科的研究实例，对多元回归、逻辑斯蒂回归、多水平模型等高级统计方法在社会科学中的应用进行探讨。 课程会用到Kruschke 的贝叶斯数据分析方法和 Bürkner’s brms，其中数据处理和可视化用到 tidyverse, 您可以在 这里 或者 这里 获得帮助，当然也可以参考我的课件《数据科学中的 R 语言》 我将持续改进课件，所以欢迎大家提出建议 非常感谢川师研究生院的信任， … Read more →

# Scientific Research Methods: Tutorials

## by Peter K. Dunn

Tutorials for quantitative research in science and health (including research design, hypothesis testing and confidence intervals in common situations) […] This book has been prepared for use with the course SCI110 Science Research Methods1 at the University of the Sunshine Coast (USC). This book is an introduction to quantitative research methods in the scientific and health disciplines, and introduces the whole research process, from asking a research question to analysis and reporting of the data. The focus, however, is on the analysis of data. This name is grammatically incorrect. The … Read more →

# 資料科學程式設計（一）

## by 林茂廷

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] 電子書加個人註記：https://via.hypothes.is/https://bookdown.org/tpemartin/ntpu-programming-for-data-science108/ gitter chatroom: https://gitter.im/ntpuecon/course-108-2-programming-for-data-science 作業：15% 期中考一：15% 期中考二：25% 期末考：35% 課堂GitHub commits：10% Sign up a Gmail account. Sign up a GitHub account. （註冊所用認證信箱建議使用gmail) Sign up a Gitter account using your GitHub account. Sign up a … Read more →

# 数据科学中的 R 语言

## by 王敏杰

一个简单的中文书示例。 […] 你好，这里是四川师范大学研究生公选课《数据科学中的R语言》的课程内容。考虑到大家来自不同的学院，有着不同的学科背景，因此讲授的内容不会太深奥（要有信心喔）。 比如在课程中以下内容就不会出现 [ f(x)=\frac{1}{\sqrt{2 \pi}} e^{-\frac{1}{2} x^{2}} ] 而出现更多的是 在跟进本课程的同时， 我强烈推荐大家阅读Hadley Wickham的 r4ds这本书 (Grolemund and Wickham 2017)。作者可是2019年8月刚刚获得考普斯总统奖（被誉为统计学的诺贝尔奖）的大神喔，点击这里可以看他照片。 1、课程安排是这样的，每个章节研究的内容都是彼此独立的，大家可以单独阅读每章及运行代码。 2、课件源代码和数据 我将持续改进课件，所以欢迎大家提出建议 3、关于课程目标 … Read more →

# Chinese Chess in R

## by Hochia

A book about playing Chinese chess with orientchessr, an R package. […] 安裝本書使用的 R … Read more →

# Introduction to R - tidyverse

## by Brendan R. E. Ansell @ansellbr3

Introduction to R - tidyverse […] This document contains the material covered in the Introduction to R (tidyverse) course taught at the Walter and Eliza Hall Institute of Medical Research. The course is taught to biomedical scientists, but the material and the teaching examples are very broad. Skills taught in this workshop can be applied to many disciplines in academia and industry. Chapters 1 through 5 make use of popular (non-biological) teaching data sets available through R. Chapters 6 onwards introduce some types biological data. Our aim with this material is to improve the … Read more →

# Applied Causal Analysis (with R)

## by Paul C. Bauer

Script for the seminar Applied Causal Analysis at the University of Mannheim. […] The present document serves both as slides and script for the MA seminar Applied Causal Analysis. This seminar is currently taught by Paul C. Bauer at the University of Mannheim (Spring 2020). The material was/is being developed by Paul C. Bauer and Denis Cohen and will constitute the basis for a book entitled “Applied Causal Analysis with R” under contract with CRC Press/Chapman & Hall. There will both be a print version as well as an openly accessible web version. The material is licensed under a … Read more →

# CASA0005 Geographic Information Systems and Science

## by Andy MacLachlan and Adam Dennett

Welcome to the CASA0005 Geographic Information Systems and Science online pratical handbook. This website is hosted on GitHub and holds all the practical instructions and data. Data used within the practicals is available online, however occasionally websites can undergo maintenance or be inaccessible due to political factors such as government shutdowns. If you need the practical data you can access it from my GitHub repository, this is explained in Hello GIS. Practical data is divided into the relevant sessions (e.g. prac1_data), although sometimes i’ll refer to a dataset used within a … Read more →

# Property Class Spring 2020

## by Dr. Taleed El-Sabawi

Property Class Spring 2020 […] [IMPORTANT: If you have any TECHNOLOGICAL issues with this website or the assignments, please contact YOUR PROFESSOR NOT Elon IT.] VIEW: This welcome video. READ: Read the course syllabus. DO: Let’s play a game to see how closely you read the syllabus! Click here. Make sure to type your name so I know that you played it! You get credit just for trying. After you have completed the challenge above, give me your thoughts on Kahoots, the gamification software that your game was hosted in. Complete this poll by clicking here. Due Tuesday April 1 by 11 pm. … Read more →

# Data Science for Psychologists

## by Hansjörg Neth

This book provides an introduction to data science that is tailored to the needs of psychologists, but is also suitable for students of the humanities and other biological or social sciences. This audience typically has some knowledge of statistics, but rarely an idea how data is prepared and shaped to allow for statistical testing. By using various data types and working with many examples, we teach tools for transforming, summarizing, and visualizing data. By keeping our eyes open for the perils of misleading representations, the book fosters fundamental skills of data literacy and cultivates reproducible research practices that enable and precede any practical use of statistics. Read more →

# Lab Notebook

## by Naomi J. Goodrich-Hunsaker, Ph.D.

Lab Notebook […] This notebook contains a general record of my experimental work. If additional information or further clarification is needed, email me at naomi.hunsaker@utah.edu. Below is a list of projects with complete analyses: Data access: https://github.com/njhunsak/BRAINSCOPE Data access: https://github.com/njhunsak/CENC Data access: https://github.com/njhunsak/HDFT Data access: https://github.com/njhunsak/KPCA Data access: https://github.com/njhunsak/MAX-IOWA Data access: https://github.com/njhunsak/MAX-MEG Data access: https://github.com/njhunsak/MAX-UCSD Data access: … Read more →

# Notes for Nonparametric Statistics

## by Eduardo García Portugués

Notes for Nonparametric Statistics. MSc in Statistics for Data Science. Carlos III University of Madrid. […] Welcome to the notes for Nonparametric Statistics for the course 2019/2020. The subject is part of the MSc in Statistics for Data Science from Carlos III University of Madrid. The course is designed to have, roughly, one lesson per each main topic in the syllabus. The schedule is tight due to time constraints, which will inevitably make the treatment of certain methods a little superficial compared with what it would be the optimal. Nevertheless, the course will hopefully give you a … Read more →

# Sepages data pre-processing guide

## by M. Rolland

Sepages data pre-processing guide […] In this document we present the different steps used to pre-treat the biomarker data in Sepages. To date this process has been applied to pregnancy urinary levels of phenols and phthalates and cytokine data. This process includes mainly: We will use the example of the Sepages pregnancy phenols data. Here is an overview of the process and the different variables … Read more →

# Orchestrating Single-Cell Analysis with Bioconductor

Online companion to ‘Orchestrating Single-Cell Analysis with Bioconductor’ manuscript by the Bioconductor team. […] This is the website for “Orchestrating Single-Cell Analysis with Bioconductor”, a book that teaches users some common workflows for the analysis of single-cell RNA-seq data (scRNA-seq). This book will teach you how to make use of cutting-edge Bioconductor tools to process, analyze, visualize, and explore scRNA-seq data. Additionally, it serves as an online companion for the manuscript “Orchestrating Single-Cell Analysis with Bioconductor”. While we focus here on scRNA-seq … Read more →

# MGHIHP HE-802, Spring 2020

## by Anshul Kumar

This e-book accompanies the course HE-802 in the MS in HPEd program at MGHIHP (http://mghihp.edu/mshped). HE-802 is a statistics course that equips students to analyze healthcare and/or behavioral data in R. […] This online e-book is the main resource to guide you through the course HE-802 in the MS in HPEd program at MGHIHP in the Spring 2020 semester. Each chapter contains reading (or links to reading) that you should do as well as an assignment that you should complete and submit by the deadline in the course calendar. My name is Anshul Kumar and I am the author/preparer of this e-book. … Read more →

# Literatura

## by Goran Kardum

R - u znanosti […] NAPOMENA: Tekst je u izradi (nije lektoriran i provjeren do kraja, nisu povezani svi literaturni navodi!) Knjiga je namijenjena svima koji žele naučiti modele obrade i prikaza podataka pomoću R jezika koristeći aplikaciju RStudio. Knjiga nije samo vodič kroz R jezik i RStudio aplikaciju, već koristi brojne izvore informacija te usporedbe različitih metoda koje se koriste u društvenim, humanističkim i biomedicinskim znanostima. Tako, ovdje možemo pronaći usporedbe različitih eksplanatornih i konfirmatornih metoda s brojnim referencijama te modeliranje (SEM). Ovo djelo nije … Read more →

# Lab Manual for the Respiratory and Immunology Project and Laboratory Research Team (RIPLRT)

## by Authors: The RIPLRT

This book constitute the lab manual for the RIPL_Effect Research Team (RIPLRT). The output format for this example is bookdown::gitbook. […] Welcome to the Respiratory and Immunology Project and Laboratory (RIPLRT) at Larkin University College of Biomedical Sciences. If you are reading our lab manual because you recently joined the RIPLRT, welcome! If you are a current member of the RIPLRT, frequently refer to our lab manual to refresh our guidelines, policies, platforms, among others. We also look forward to immerse you in different learning experiences, such as in immunology, respiratory, … Read more →

# University of Utah

## by Naomi J. Goodrich-Hunsaker, Ph.D.

University of Utah […] This manual contains a general descriptions of all neuroimaging pipelines currently being used to analyze MRI scans. If additional information or further clarification is needed, email me at naomi.hunsaker@utah.edu. … Read more →

# Solutions for ‘Introdcution to The New Statistics’

## by Peter Baumgartner

This website is a companion book for Introduction to the New Statistics (abbreviated itns). It offers interactive exercises with solutions, and an R tutorial for the end-of-chapter exercises. […] This interactive book is a companion book for Introduction to the New Statistics (abbreviated itns). The two types of additional material it offers are: I have built the interactive exercises in this book with H5P.org. H5P stands for HTML5 Package, a free and open-source content collaboration framework based on JavaScript. With H5P, it is easy for everyone to display, create, share, and reuse … Read more →

# Panduan Menyusun Database Menggunakan Microsoft Access

## by Technaut

Buku ini merupakan panduan yang digunakan oleh peserta kelas Data Science untuk Bisnis dalam menyusun database penjualan menggunakan Microsoft Access. […] Buku ini merupakan bacaan tambahan bagi peserta kelas Data Science untuk Bisnis. Pada buku ini diberikan penjelasan singkat terkait cara membangun database menggunakan aplikasi Microsoft Access. Adapun topik yang disajikan dalam buku ini antara … Read more →

# Authentic Data Science

## by Robert Barcik

This is a book that allows anyone to become a Embedded Data Scientist in an intuitive way. […] Data Science is undoubtedly one of the fastest growing areas within IT. This growth is not unjustified though. Majority of the firms did undergo a process of digitalization within the past 2-3 decades. Opportunities within digitalization, such as digital offering of products, digital channels for marketing, are reaching new maturity points. If we look around ourselves, majority of products which make sense to be digitalized, indeed are (if we disregard laggard governments). According to … Read more →

# 3 Chicken Chicken Chicken | Chicken Chicken Chicken: Chicken Chicken

## by Giles Knight

Chicken. […] Chicken chicken chicken “chicken chicken, chicken”. Chicken chicken? Chicken chicken chicken 75 chicken chicken -25% chicken chicken chicken 50 chicken. Chicken chicken chicken $1000 chicken! Chicken chicken chicken chicken 100 chicken chicken 25 chicken (Fig. … Read more →

# Social Advocacy & Ethical Life

## by Yuleng Zeng

This is an introduction to Social Advocacy & Ethical Life (SAEL 200). It is a class I started teaching in Fall 2019, as a member of the Bridge Humanities Corps (BHC) at the University of South Carolina. In compiling this document, I consult a number of online resources. The intention is to record the process of my preparation for this class and help me improve over time. If you see errors, have suggestions, or do not wish your material to be cited here, please do shoot me an email. The syllabus of the class can be found here: … Read more →

# Ciencia de datos para curiosos

## by Martin Montane

Una introducción practica a la Ciencia de Datos […] La ciencia de datos ha estado presente casi en cualquier contexto que se pueda pensar: en los medios masivos, en nuestra experiencia diaria cuando usamos Netflix o nos tomamos el subte y en la charla con colegas o incluso familiares y amigos. Este libro tiene como objetivo principal dar una idea sobre qué es la ciencia de datos, para qué sirve y cómo podemos usarla. Para esto, se necesita solo una cosa: curiosidad. Con estas ganas de conocer lo que hoy no conocemos, pero que nos llama la atención, el resto de las herramientas pueden ir … Read more →

# Reproducible Miracle

## by Gökmen Altay

This book demonstrates 19 based encodings available in authentic Quran text using Text Analytics and Probability approaches. […] This book presents 19 based codings that I tested and witnessed in a book fully written 1387 years ago (632) than the publication date (2019) of this book. Most of the codings could only be practically realized by the invention of computers. You will not only witness some of the amazing examples of the 19 based codings but also have the ability to easily test them yourself by running the codes I will provide along with each coding example. It means all the … Read more →

# Introduction to Data Science

## by Rafael A. Irizarry

This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning and helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, file organization with UNIX/Linux shell, version control with GitHub, and reproducible document preparation with R markdown. Read more →

# Surrogates

## by Robert B. Gramacy

Surrogates: a new graduate level textbook on topics lying at the interface between machine learning, spatial statistics, computer simulation, meta-modeling (i.e., emulation), and design of experiments. Gaussian process emphasis facilitates flexible nonparametric and nonlinear modeling, with applications to uncertainty quantification, sensitivity analysis, calibration of computer models to field data, sequential design and (blackbox) optimization under uncertainty. Presentation targets numerically competent scientists in the engineering, physical, and biological sciences. Treatment includes historical perspective and canonical examples, but primarily concentrates on modern statistical methods, computation and implementation in R at modern scale. Rmarkdown facilitates a fully reproducible tour complete with motivation from, application to, and illustration with, compelling real-data examples. Read more →

# Predictive Analytics for Actuaries

## by Sam Castillo

Predictive Analytics for Actuaries […] Welcome! This is the study guide for the SOA’s Predictive Analytics Exam. While meeting all of the learning requirements of Exam PA, this book gives you data science and machine learning training. You will learn how to get your data into R, clean it, visualize it, and use models to derive business value. Just as a scientist sets up lab experiments to form and test hypothesis, you’ll build models and then test them on holdout sets. The statistics is just the first phase, as you’ll also learn how to explain the results in easy-to-understand, … Read more →

# Elegant Bookdown Template

## by 黄湘云

最初看到 elegantbook 做的书籍样式很漂亮，就想把它引入到 bookdown 中，遂定制了此模版。在此基础上，做了迁移和扩展的工作，融合了 LaTeX (精美)、Pandoc (简洁) 和 R (强大) 的特性。This is a bookdown template based on ElegantBook. The output format for this template is bookdown::gitbook and bookdown::pdf_book. […] A Markdown-formatted document should be publishable as-is, as plain text, without looking like it’s been marked up with tags or formatting instructions. — John Gruber 这是一份 R Markodwn 文档。 Markdown … Read more →

# UI-UX

## by 林茂廷老師

學習並瞭解網頁設計與網頁服務的UI-UX，最終結合社會科學與科技技能，運用所學發展跨領域的服務項目。 […] 每一章架構如下： 學習主題列點 各主題內容 課後練習 每章針對學習主題內容錄製20-30分鐘講解影片。（因為影片不長，每章內容要求精簡，但課後練習可以有兩題，一題簡單，一題難要進一步自學找答案。） 使用自己的電腦或系上多媒體室。 錄製技術採傳承製：前一章在錄時，下一章的人在旁協助或觀看。 上傳Vimeo 每週五只針對課後練習討論，成員（含USR人員）在上課前要自己看完影片及做好課後練習 練習結果請放在自己的Github練習repo裡（練習repo由https://github.com/tpemartin/uiux-inclass-practice fork回去） … Read more →

# R Markdown Cookbook

## by Yihui Xie

Examples, tips, and tricks of using R Markdown. […] This book is in a very early stage of development. If you have any suggestions on what should be included within this book, please get in touch via GitHub. R Markdown is a powerful tool for combining analysis and reporting into the same document. Since the development of the rmarkdown package (Allaire et al. 2020), it has grown to become a diverse ecosystem of code, and reports, books and websites can all easily be generated directly from R code. There is a wealth of guidance which has grown over the past few years, and the book R … Read more →

# Statistik Vorlesung

## by Lisa Lechner

Dies sind Begleitnotizten für die Vorlesung Statistik. […] “Statistics is the grammar of science” (Karl Pearson) Ziel der Vorlesung Statistik ist, die Einführung in die zentralen Grundlagen und Begriffe der deskriptiven (Datenmatrix, Häufigkeitsverteilungen, Lagemaße, Streuungsmaße, Verteilungskenngrößen, Zusammenhang zwischen Variablen) und der induktiven Statistik zu geben. Gegenstand der induktiven Statistik ist es, durch geeignete Verfahren von der Stichprobe auf die Grundgesamtheit zu schließen und die Sicherheit der Schlussfolgerung abzuschätzen, d.h. Wahrscheinlichkeiten für die … Read more →

# HealthyR: R for health data analysis

## by Ewen Harrison and Riinu Pius

An introductory book for health data analysis using R. […] For draft version This is the electronic version of the HealthyR book to be published by CRC Press/Chapman & Hall in summer 2020. The electronic version will always be freely available. HealthyR resources: healthyr.surgicalinformatics.org Version 0.9.7 This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/us/ … Read more →

# Modern R with the tidyverse

## by Bruno Rodrigues

This book will teach you how to use R to solve you statistical, data science and machine learning problems. Importing data, computing descriptive statistics, running regressions (or more complex machine learning models) and generating reports are some of the topics covered. No previous experience with R is needed. […] This book is still being written. Chapters 1 to 8 are almost ready, but more content is being added (especially to chapter 8). 9 and 10 are empty for now. Some exercises might be at the wrong place too and more are coming. You can purchase an ebook version of this book on … Read more →

# Estadística Aplicada con R

## by Oscar González Frutos

Estadística Aplicada con R […] Dirigirse a la página web http://cran.r-project.org/ donde encontrarán el programa de instalación en Linux, (Mac) OS X y Windows. Seleccionar la opción deseada y descargar el ejecutable correspondiente. RStudio se descarga gratuitamente de su página web: https://rstudio.com/products/rstudio/download/. Para un usuario, hay que elegir la versión Desktop. Se descargará un archivo ejecutable, aceptar las opciones que ofrece por defecto RStudio. Si ya hemos instalado R en nuestro equipo, RStudio lo detectará automáticamente y podremos utilizarlo desde este entorno. … Read more →

# dealing with gin

## by Julien Colomb and Keisuke Sehara

dealing with gin […] Introduction to how to deal with gin (web interface, command-line, and probably WinGIN GUI). This book covers multiple topics, see the table of content to navigate … Read more →

# IEP Seasonal Monitoring Report: Spring 2018 | Secchi Depth

## by Rosemary_Hartman

This report shows trends in water quality, plankton, and fish across multiple IEP surveys for March through May of 2018. […] This report shows trends in water quality, plankton, and fish across multiple IEP surveys for March through May of 2018. Disclaimer: While substantial efforts are made to ensure the accuracy of these data, complete accuracy of data sets cannot be guaranteed. This report was developed by the IEP Synthesis Team. For questions, comments, or corrections, contact Rosemary Hartman – Rosemary.Hartman@water.ca.gov … Read more →

# Introduction to Quantitative Methods in R

## by Eric van Holm, PhD

This is a textbook written for POLI 2900 at the University of New Orleans. […] This book is written for use in POLI 2900: Methods of Political Research at the University of New Orleans. It was originally written for the Fall of 2019, but will continue to be updated after that class. In this book I cover quantitative research techniques common to the social sciences as well as attempt to develop student’s skills in programming. In order to practice programming and learn quantitative methods, we will utilize R, a popular programming language for data scientists and researchers. This book is … Read more →

# economic-survey.utf8

## by markbneal

Economic performance of New Zealand dairy farms […] The 2018-19 DairyNZ Economic Survey is the fourteenth annual survey of New Zealand dairy farmers using dairy farm business data from DairyBase®. The Economic Survey of Factory Supply Dairy Farmers was first published in 1963-64 by the New Zealand Dairy Board. In 1988-89 the survey was undertaken by Livestock Improvement Corporation (LIC) and then Dexcel in 1999-00, when the name was changed to Economic Survey of New Zealand Dairy Farmers. From 2005-06 DairyNZ published the survey under the new title DairyNZ EconomicSurvey. DairyNZ is the … Read more →

# 三國志で学ぶデータ分析 (Japan.R 2019)

## by ill-identified

“三国志を題材にしたRを使ったデータ分析のチュートリアル” […] この記事は 2019/12/7 に開催された Japan.R の発表原稿をもとに作成した資料である. この記事の目的は2つ. ここでいう「データ分析」とは, なるべく複雑高度なテクニックを乱用せず必要最小限の方法で何かを言おうというものである. 今回の「データ分析」はスクレイピングによるデータ取得, データの加工整形, 要約統計量の計算, グラフによる視覚化, というよくあるデータ分析のアプローチであり, 使っているパッケージもrvest(スクレイピング), tidyrと dplyr(データの加工整形),ggplot2(グラフ作成)など様々な場面で使われるRの代表的なパッケージばかりで … Read more →

# Interpretable Machine Learning

## by Christoph Molnar

Machine learning algorithms usually operate as black boxes and it is unclear how they derived a certain decision. This book is a guide for practitioners to make machine learning decisions interpretable. […] Machine learning has great potential for improving products, processes and research. But computers usually do not explain their predictions which is a barrier to the adoption of machine learning. This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simple, interpretable models such as … Read more →

# RSCD - JR

## by Gabriel Carrasco

Code Repository - Reactive Serological Case Detection (RSCD) … Read more →

# 醫學統計學

## by 王 超辰 Chaochen Wang

在LSHTM的學習筆記 […] We are drowning in information and starving for knowledge. — Rutherford D. Roger 尚未想好寫什麼作前言。我只是默默地想留下一些筆記和思考。本書用了兩個 R 包編譯，分別是 knitr (Xie 2015) 和 bookdown (Xie 2018)。 在開始倒計時離開倫敦的時刻，我再次翻閱這些思考過的，痛苦過的，糾結過的，忐忑過的，這一年的學習筆記，感慨良多。倫敦衛生與熱帶醫學院 曾經是，現在依然還是我魂牽夢繞的學院，它的歷史積澱，它的小巧精緻，在它的樓道里度過的每一天都是那麼的充實而值得感動。這本書不僅僅是我的統計學學習的心路歷程，還傾注了這裡每一位老師，每一個一起奮鬥過的同學，我們的歡聲笑語，我們的喜怒哀樂。如果你的電腦/手機/iPad屏幕上打開了這本書，說明你將要或者已經是我的同仁，如果此生有 … Read more →

# A Very Short Course on Time Series Analysis

## by Roger D. Peng

The book covers material taught in the Johns Hopkins Biostatistics Time Series Analysis course. […] This book will cover the use of time series methods in biomedical and public health applications. And maybe rockets? We will use the following … Read more →

# R pour les scientifiques : Mise en œuvre de projets et valorisation des résultats

## by François Rebaudo

Un guide pour acquérir les bases de la programmation avec R et conduire efficacement la gestion et l’analyse de ses données. […] Je remercie tous les contributeurs qui ont participé à améliorer ce livre par leurs conseils, leurs suggestions de modifications et leurs corrections (par ordre alphabétique) : Les versions gitbook, html et epub de ce livre utilisent les icônes open source de Font Awesome (https://fontawesome.com). La version PDF utilise les icônes issues du projet Tango disponibles depuis openclipart (https://openclipart.org/). Ce livre a été écrit avec le package R bookdown … Read more →

# Handout for Cognitive Diagnosis Modeling

## by Wenchao Ma

This is the handout for CDM class by Wenchao Ma. […] This online R handout is developed for Cognitive Diagnosis Modeling class. The major goal of this handout is to illustrate how GDINA R package can be used for various CDM … Read more →

# CASA Lab Coding for Speech Group

## by Director: Thea Knowles, PhD

This is the record keeping site for the CASA Lab Coding for Speech group. […] The purpose of this group is for members and friends of the CASA Lab to learn basic coding skills that are helpful for speech analysis. The languages/environments we will learn will usually be R and Praat, though we may extend to include other languages over time. Many of the skills we will work on will be directly applicable to projects in the CASA Lab, but the hope is that you will gain skills that you will find personally valuable as well for your own work. We will meet approximately twice a month. Meeting … Read more →

# ¡Manos a la Data!

## by BEST: Behavioral Economics & Data Science Team

Open book que recolecta cada data del proyecto del mismo nombre […] Nota: El libro se encuentra en constante desarrollo. Se actualizará cada semana producto de resumir los análisis de las bases de datos semanales del proyecto manos a la data. Este libro ha sido elaborado por BEST. Hace unos años el término Data Science no era tan conocido ni utilizado por la comunidad internacional, y menos aún local (Perú). En realidad, era un término usado rara vez por los estadísticos y algunos miembros de la computación científica. Y es que nuestra sociedad ha evolucionado, y con ellos ciertas … Read more →

# A Appendix A - SQL code | Cervical Screening - telephone-based recall (2019)

## by Dr. Buddini Ekanayake & Dr. David Fong

A Appendix A - SQL code | Cervical Screening - telephone-based recall (2019) […] Find eligible patients (female, age criteria), who are active patients (defined by three or more contacts in the past two years, including one billed visit in the past six months), who have not had cervical screening detected found in the PapSmear table or the Investigations table. Note that there is another table ‘ObGyn’ which should contain the most recent cervical screening result, but unfortunately, ‘over-chose’ 5 patients (found 295 patients, instead of ‘290’ as found using this search.) Code finds … Read more →

# Introduction to Data Science

## by Ron Sarafian

Class notes for the BGU course - Introduction to Data Science. […] This book accompanies the course I give at Ben-Gurion University, named “Introduction to Data Science”. This is an introductory-level, hands-on focused course, designed for students with basic background in statistics and econometrics, and without programming experience. It introduces students to different tools needed for building a data science pipeline, including data processing, analysis, visualization and modeling. The course is taught in R environment. Most of the contents in this book are taken from BGU’s “R” course, … Read more →

# 现代应用统计

## by 黄湘云

线性模型理论及其应用，特别是各个模型的适用范围、参数估计方法、模型检验和诊断，重视理论和算法实现的并重，同时附以真实的案例分析。恪守所有的模型都是线性模型的原则，将线性模型、广义线性模型、广义可加模型、线性混合效应模型、广义线性混合效应模型和广义可加混合效应模型融合到同一框架下。应用层面，我们还要考虑数据集的平衡问题、缺失问题和异常问题。应用场景包括环境污染、流行病学和风险控制等领域。 […] 这只是零碎的个人笔记，距离一本书还很遥远！ Essentially, all models are wrong, but some are useful. — George Box (Box and Draper 1987) 写作灵感来自 Common statistical tests are linear models (or: how to teach stats) 参考文献 《Modern Applied Statistics with S》（第四版）(Venables and Ripley 2002) 和 《Mixed-effects models in S … Read more →

# Explanatory Model Analysis

## by Przemyslaw Biecek and Tomasz Burzykowski

This book introduces unified language for exploration, explanation and examination of predictive machine learning models. […] … Read more →

# Text Mining with R

## by Julia Silge and David Robinson

A guide to text analysis within the tidy data framework, using the tidytext package and other tidy tools […] This is the website for Text Mining with R! Visit the GitHub repository for this site, find the book at O’Reilly, or buy it on Amazon. This work by Julia Silge and David Robinson is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States … Read more →

# PostgreSQL Explained for R-Users and R-Programmers

## by Ben Gonzalez

This is a book that explains PostgresSQL for R programmers and helps explain SQL syntax as well. […] For anyone interested in using this book you will need the following packages and tools to follow along. The RPostgreSQL and DBI package can be installed from CRAN or Github: Notes on using SQL syntax in RPostgreSQL To successfully query data in PostgreSQL the following caveats may be necessary. This is especially the case if someone has created column names that are unique and odd in some form or … Read more →

# Methods for Network Analysis

## by Mark Hoffman

Methods for Network Analysis […] This 4-5 credit hour seminar is intended as a theoretical and methodological introduction to social network analysis. Though network analysis is an interdisciplinary endeavor, its roots can be found in classical anthropology and sociology. Network analysis focuses on patterns of relations between actors. Both relations and actors can be defined in many ways, depending on the substantive area of inquiry. For example, network analysis has been used to study the structure of affective links between persons, flows of commodities between organizations, shared … Read more →

# Physical Geology

## by Karla Panchuk, Steven Earle, and contributors (GitHub/bookdown version maintained by Dewey Dunnington)

“Physical Geology”, adaptaed from Physical Geology: First University of Saskatchewan Edition […] Physical Geology is a comprehensive introductory text on the physical aspects of geology, including rocks and minerals, plate tectonics, earthquakes, volcanoes, mass wasting, climate change, planetary geology and much more. It has a strong emphasis on examples from western Canada. It is adapted from “Physical Geology” written by Steven Earle for the BCcampus Open Textbook Program, and “Physical Geology, First University of Saskatchewan Edition” by Karla Panchuk. The GitHub/bookdown version of … Read more →

# STAT160 R/RStudio Companion

## by Statistics/Data Science at St. John Fisher College

Companion document to Introduction to Statistical Investigations using R/RStudio. […] This companion is for use in STAT160 (Introduction to Data Science). The textbook for the course is Introduction to Statistical Investigations (Tintle et. al). Through in-class and home work assignments, students will learn to use R and RStudio. In this companion, we will review the commands and functions students will need to perform statistical analysis and generate statistical … Read more →

# Statistical Rethinking with brms, ggplot2, and the tidyverse

## by A Solomon Kurz

This project is an attempt to re-express the code in McElreath’s textbook. His models are re-fit in brms, plots are redone with ggplot2, and the general data wrangling code predominantly follows the tidyverse style. […] I love McElreath’s Statistical Rethinking text. It’s the entry-level textbook for applied researchers I spent years looking for. McElreath’s freely-available lectures on the book are really great, too. However, I prefer using Bürkner’s brms package when doing Bayesian regression in R. It’s just spectacular. I also prefer plotting with Wickham’s ggplot2, and coding with … Read more →

# Data Processing & Visualization

## by Michael Clark

The focus of this document is on common data processing and exploration techniques in R, especially as a prelude to visualization. The first part of the document will cover data structures, the dplyr and tidyverse packages, which enhance and facilitate the sorts of operations that typically arise when dealing with data, including faster I/O and grouped operations. For visualization, the focus will be on using ggplot2 and other packages that allow for interactivity. In addition, basic programming concepts and techniques are introduced. Exercises may be found in the document as well. In addition, the demonstrations of most content in Python is available via Jupyter notebooks. Read more →

# A guide to the 2017 European Internet Panel Study

## by sveinungarnesen78

This is a guide to the 2017 European Internet Panel Study data set. […] The EIPS is a collaboration between six European probability-based online survey panels. This document gives an overview of the fourth survey, conducted in 2017 (N = 18249). The 2017 joint survey wave was fielded in France by the L’ ́etude longitudinale par internet pour les sciences social sat Sciences Po, in Germany by the German Internet Panel at the University of Mannheim, in Iceland by the Social Science Research Institute Panel (University of Reykjavik), in The Netherlands by the Longitudinal Internet Studies for … Read more →

# EMMERG Spatial

## by Gabriel Carrasco

Working notes […] Quantify the effects of contextual variables by coupling spatial and temporal structures in a Bayesian model framework. The model will then used to produce out-of-sample predicted probabilities of exceeding county-specific outbreak thresholds to develop early warning information to predict the risk of emerging drug … Read more →

# Introduction to Computational Social Science

## by Mark Hoffman and Tyler McDaniel

Introduction to Computational Social Science […] This seminar is intended as a theoretical and methodological introduction to computational social science. Each week covers substantive and theoretical material and is associated with a technical lab. You will need to bring your laptops to each class. In the technical labs you will learn how to analyze network data in R. This e-book contains all of the technical labs in the order that we cover them. Should you forget anything we learned, you will be able to return to this e-book to cover the material again on your … Read more →

# Eagle I.O Consultant Manual

## by Eagle I.O

This is the student consultant handbook for members of Eagle I.O. The output format for this example is bookdown::gitbook. […] This is the student consultant handbook for members of Eagle I.O. Herein lie expectations, responsibilities, and strategy to keep Eagle I.O sustailable with future Montclair State University I/O Psychology … Read more →

# Data Science with R: A Resource Compendium

## by Martin Monkman

A modest and very incomplete listing of resources for tackling data science problems in R. […] Draft This book grew out of my evergrowing collection of reference materials that was saved as an expanding array of markdown files in a github repo. By assembling it as a book, I hope that it will be more accessible and useful to other R users. The author would like to acknowledge everyone who has contributed to the books, articles, blog posts, and R packages cited within. License This work by Martin Monkman is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Canada … Read more →

# FPM280B - Practicum

## by Gabriel Carrasco-Escobar

Codebook for FPM280B […] Peru has made great strides in reducing chronic childhood malnutrition. According to the World Bank, people from around the world visit the country to learn their approach to treating the condition. But Peru has been less fortunate in combating another public health problem that’s not as well-known. Anemia is a blood disorder that results from a lack of iron which is needed to carry oxygen to the blood. And in Peru it affects 2-in-5 children under the age of 3, slowing their development. Anemia is presenting a huge hazard to the future of Peruvian children and their … Read more →

# Neuroimaging Protocols

## by Naomi J. Goodrich-Hunsaker, Ph.D.

Neuroimaging Protocols […] This manual contains a general descriptions of all neuroimaging pipelines currently being used to analyze MRI scans. If additional information or further clarification is needed, email me at naomi.hunsaker@utah.edu. … Read more →

# 西山 他『計量経済学』のためのR

## by 北川梨津

西山 他『計量経済学』のためのR […] 本稿では，西谷慶彦・新谷元嗣・川口大司・奥井亮（2019）『計量経済学』有斐閣の実証例を再現するためのRのサンプルコードを紹介し，実証の練習問題のヒントを与えます1． 有斐閣のウェブサポートページでは，実証例の再現や実証の練習問題に必要なデータセットが公開されています．データセットはそこからダウンロードしてください．また，有斐閣のウェブサポートページでは，StataとEViewsのサンプルコードも用意されています．それらのソフトウェアが得意な読者は，せひそちらも参照してください． 練習問題については基本的にヒントを与えるのみにしているのは，宿題として使われる先生がいるかもしれ … Read more →

# Data Analysis in R

## by Steve Midway

This is a text that covers the principles and practices of handling and analyzing data. … Read more →

# Automated Content Analysis

## by Kostas Gemenis & Bastiaan Bruinsma

An overview of all the workshops for the course. […] Throughout these seminars we will use R. R is an open-source programme that allows you to carry out a wide variety of statistical tasks. At its core, it is a modification of the programming languages S and Scheme, making it not only flexible but fast as well. R is available for Windows, Linux and OS X and receives regular updates. In its basic version, R uses a simple command line interface. To give it a friendlier look, environments such as RStudio and RCommander are available. Apart from looking better, these environments also provide … Read more →

# Basic R Commands

## by Lecturer: Fábio M. Corrêa **Email: f.correa@ru.ac.za

Basic R Commands […] 21-02-2020 The text is under development and updates are … Read more →

# R for data science: tidyverse and beyond

## by Maxine

R for data science: tidyverse and beyond […] 关于 R for Data Science (Wickham and Grolemund 2016) 的个人笔记，随缘更新。任何建议：https://github.com/enixam/rfordatascience/issues 或 565702994@qq.com tidyverse … Read more →

# The Hitchhiker’s Guide to the tlverse

## by Jeremy Coyle, Nima Hejazi, Ivana Malenica, Rachael Phillips, Alan Hubbard, Mark van der Laan

An open-source and fully-reproducible electronic handbook for applying the targeted learning methodology in practice using the tlverse software ecosystem. […] The Hitchhiker’s Guide to the tlverse, or a Targeted Learning Practitioner’s Handbook is an open-source and fully-reproducible electronic handbook for applying the targeted learning methodology in practice using the tlverse software ecosystem. This work is currently in an early draft phase and is available to facilitate input from the community. To view or contribute to the available content, consider visiting the GitHub repository … Read more →

# Exploration de données avec R

## by Anouar El Ghouch

Ce document est une introduction à l’utilisation du logiciel libre de traitement de données et d’analyse statistique R. il est inspiré de plusieurs sources: Ce document vise à introduire uniquement les notions de base nécessaire à connaitre pour quelqu’un qui découvre le logiciel pour la première … Read more →

# 『Rによる原因を推論する』

## by 北川 梨津，原 健人

久米ゼミのプレゼミのための教材です. […] 久米郁男ゼミにようこそ．これから皆さんは，因果推論の方法を2年間みっちり学びます．久米ゼミでは，因果推論のツールとして主に計量分析を利用します．計量分析のためには，数多くの統計解析を行ってくれる統計ソフトウェアを使いこなせることが不可欠です．本書は，『原因を推論する』で提示されるフレームワークに添いながら，Rというプログラミング言語による初歩的な計量分析を実践的に解説することを目的とします． 2年間，因果推論と計量分析をまじめに学べば，あなたの市場価値は飛躍的に高まるはずです．その第一歩を確実に踏み出しましょう．努力は実らないこともありますが，実るまで努 … Read more →

# Using Spark from R for performance with arbitrary code

## by Jozef Hajnala

This bookdown publication attempts to provide practical insights into using the sparklyr interface to gain the benefits of Apache Spark while still retaining the ability to use R code organized in custom-built functions and packages. […] Apache Spark is a popular open-source analytics engine for big data processing and thanks to the sparklyr and SparkR packages, the power of Spark is also available to R users. This short publication attempts to provide practical insights into using the sparklyr interface to gain the benefits of Apache Spark while still retaining the ability to use R code … Read more →

# VDOT_GIS

## by Gabriel Carrasco

Code repository for VDOT Analysis […] Determine mobility patterns of VDOT … Read more →

# Statistical Tools for Causal Inference

## by The SKY Community

This is an open source collaborative book. […] $$ \newcommand{\uns}[1]{\mathbf{1}[#1]} \newcommand{\esp}[1]{\mathbf{E}[#1]} \newcommand{\Ind}{\perp\kern-5pt\perp} \newcommand{\var}[1]{\mathbf{V}[ #1 ]} \newcommand{\cov}[1]{\mathbf{C}[ #1 ]} \newcommand{\plim}[1]{\text{plim}_{ #1 \rightarrow \infty}} \newcommand{\plims}{\text{plim}} \newcommand{\partder}[2]{\frac{\partial #1}{\partial #2}} \DeclareMathOperator{\diag}{diag} $$ a.sourceLine { display: inline-block; line-height: 1.25; } a.sourceLine { pointer-events: none; color: inherit; text-decoration: inherit; } a.sourceLine:empty { height: … Read more →

# Прикладна аналітика для активістів природоохоронного руху

## by Василенко Євген

This is a practical example of using the R programming language to environmental protection activism. All examples based on practical cases of Public Association «Ecological Council of Kryvorizhzhya» (Ukraine) […] Ця книга є наочним посібником для активістів природоохоронного руху. Тут містяться складні, проте дуже важливі на сьгоднішній момент практичні поради: яким чином довести до основної маси населення проблеми надзвичайного забруднення українського довкілля. Як показати людям те, що нічого в природі не виникає «просто так». Як показати людям вплив десятків невидимих неозброєним оком … Read more →

# Doing Bayesian Data Analysis in brms and the tidyverse

## by A Solomon Kurz

This project is an attempt to re-express the code in Kruschke’s (2015) textbook. His models are re-fit in brms, plots are redone with ggplot2, and the general data wrangling code predominantly follows the tidyverse style. […] Kruschke began his text with “This book explains how to actually do Bayesian data analysis, by real people (like you), for realistic data (like yours).” In the same way, this project is designed to help those real people do Bayesian data analysis. My contribution is converting Kruschke’s JAGS and Stan code for use in Bürkner’s brms package, which makes it easier to fit … Read more →

# rdwd

## by Berry Boessenkool, berry-b@gmx.de

rdwd is an R package to handle data from the German Weather Service (DWD). This vignette has 3 main sections: Important links: The remainder of this intro chapter is a copy of the github README file. rdwd is an R package to select, download and read climate data from the German Weather Service (Deutscher Wetterdienst, DWD). The DWD provides thousands of datasets with weather observations online at opendata.dwd.de. Since May 2019, rdwd also supports reading the Radolan (binary) raster data at grids_germany. rdwd is available on CRAN: It has been presented at FOSDEM 2017 and UseR!2017 in … Read more →

# Geocomputation with R

## by Robin Lovelace, Jakub Nowosad, Jannes Muenchow

Geocomputation with R is for people who want to analyze, visualize and model geographic data with open source software. It is based on R, a statistical programming language that has powerful data processing, visualization, and geospatial capabilities. The book equips you with the knowledge and skills to tackle a wide range of issues manifested in geographic data, including those with scientific, societal, and environmental implications. This book will interest people from many backgrounds, especially Geographic Information Systems (GIS) users interested in applying their domain-specific knowledge in a powerful open source language for data science, and R users interested in extending their skills to handle spatial data. Read more →

# Strengthening the Reporting of Observational Studies in Epidemiology STROBE (STROBE) Educational Expansion

## by Melissa K Sharp

This site is a public, open-source repository for epidemiological research methods and reporting skills for observational studies. We aim to be as inclusive as possible but this site is based on the Strengthening the Reporting of Observational studies in Epidemiology Statement. […] The purpose of this site is to create a public, open-source repository for epidemiological research methods and reporting skills for observational studies. Epidemiology, the study of diseases and population health, is a broad field with ever-changing methods and often heated debates about proper designs, … Read more →

# Predict Crypto Database Quick Start Guide

## by Ricky Esclapon - riccardo.esclapon@colorado.edu

This is a quick start guide for the Predict Crypto database which should provide the support you need to interact with the database and pull data as you would like. […] This is a quick start guide for the Predict Crypto DataBase which should provide the support you need to interact with the database and pull data. Everything you need to know will be outlined in this document and you can use the sidebar on the left (s is the hotkey to show/hide it) to review the following sections: Overview- This section. Interacting with the DB- Instructions around accessing the Metabase environment that … Read more →

# The dymiumCore package

## by Amarin Siripanich

This is a documentation of the dymiumCore package. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). The bookdown package can be installed from CRAN or Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need XeLaTeX. You are recommended to install TinyTeX (which includes XeLaTeX): https://yihui.org/tinytex/. … Read more →

# RMarkdown for Scientists

## by Nicholas Tierney

A book created for a 3 hour workshop on rmarkdown […] This is a book on rmarkdown, aimed for scientists. It was initially developed as a 3 hour workshop, but is now developed into a resource that will grow and change over time as a living book. This book aims to teach the following: There are many great books on R Markdown and it’s various features, such as “Rmarkdown: The definitive guide”, “bookdown: Authoring Books and Technical Documents with R Markdown”, and “Dynamic Documents with R and knitr, Second edition”, and Yihui Xie’s thesis, “Dynamic Graphics and Reporting for Statistics”. So … Read more →

# Alfredo’s Laboratory Notebook

## by Alfredo Enrique Gonzalez

This is the virtual laboratory notebook of Alfredo E. Gonzalez and his research and work in the Boutros Lab. […] This is the virtual laboratory notebook containing the work of Alfredo E. Gonzalez for the Boutros Lab. The first two chapters (Module 1 and Module 2) are dedicated to the two folders of the Boutros Lab introductory bioinformatics training in the R programming language. Within both of these chapters there are subsections for each question of the problem sets. As additional analysis are generated in real time, they will be appended to the subsequent chapters. Additional notes (# … Read more →

# Boutros Laboratory Notebook

## by By: Alfredo Enrique Gonzalez

This is the virtual laboratory notebook of Alfredo E. Gonzalez for his research and work in the Boutros Lab […] … Read more →

# Alfredo’s Laboratory Notebook

## by Alfredo E. Gonzalez

Alfredo’s Laboratory Notebook with answers to R training modules … Read more →

# EDP Sun Power prediction Challenge

## by Sergio Berdiales

EDP Sun Power prediction Challenge […] En este notebook estoy incluyendo el proceso de creación de los modelos con los que intentaré colarme en el ranking de participantes del Challenge de machine learning “Sun Power Prediction” que EDP tiene colgado en su web de open data y que incluyo a continuación (Fecha: 2019-11-14). “The objective of this competition is to build an algorithm that predicts the production of solar module B (with optimal orientation) for the first seven days of 2018. For this, you can rely on the weather station data for these days.” Hasta ahora sólo hay 7 participantes, … Read more →

# Matts Baking Cook Book

## by birderboone

Matts Baking Cook Book … Read more →

# rOpenSci Packages: Development, Maintenance, and Peer Review

## by rOpenSci software review editorial team: Brooke Anderson, Scott Chamberlain, Anna Krystalli, Lincoln Mullen, Karthik Ram, Noam Ross, Maëlle Salmon, Melina Vidoni

Extended version of the rOpenSci packaging guide. This book is a guide for authors, maintainers, reviewers and editors of rOpenSci. The first section of the book contains our guidelines for creating and testing R packages. The second section is dedicated to rOpenSci’s software peer review process: what it is, our policies, and specific guides for authors, editors and reviewers throughout the process. The third and last section features our best practice for nurturing your package once it has been onboarded: how to collaborate with other developers, how to document releases, how to promote your package and how to leverage GitHub as a development platform. The third section also features a chapter for anyone wishing to start contributing to rOpenSci packages. Read more →

# Marrow’s Compendium of Dragonslaying

## by Marrow , Heartseeker-US

This is a Fury Warrior Guide for World of Warcraft Classic. […] This is a guide on how to play a Fury Warrior in World of Warcraft Classic. It is a work in progress and a living document. All of the information contained within reflects what is best understood as of today, and some of is subject to change as more about the game is discovered. More importantly, this is a guide for players who want to push the envelope of their class, and be the best they can be. That is not the playstyle of every player, nor am I advocating it should be. Ultimately, you should pick your race and spec so that … Read more →

# Crypto Research Paper Tutorial Outline

## by Riccardo Esclapon

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] We could probably use a section at the start to get all packages installed. The code below can be used to install the PredictCrypto package in R: We will also be using the tidyverse(Wickham 2019), so let’s import that as well: The paper will have to be a static file because that’s how a research paper would work obviously, so there will be two main ways to follow along with the tutorial: Reproducible examples are extremely important to have in the research … Read more →

# blogdown: Creating Websites with R Markdown

## by Yihui Xie, Amber Thomas, Alison Presmanes Hill

A guide to creating websites with R Markdown and the R package blogdown. […] In the summer of 2012, I did my internship at AT&T Labs Research,1 where I attended a talk given by Carlos Scheidegger (https://cscheid.net), and Carlos said something along the lines of “if you don’t have a website nowadays, you don’t exist.” Later I paraphrased it as: “I web, therefore I am a spiderman.” Carlos’s words resonated very well with me, although they were a little exaggerated. A well-designed and maintained website can be extremely helpful for other people to know you, and you do not need to wait for … Read more →

# Nighttime Lights and Malaria in the Peruvian Amazon

## by Gabriel Carrasco-Escobar

Code library of data exploration and analysis […] Nighttime Lights (NL) datasets were used into urban planning, econometric downscaling, disaster response, development, and socio-environmental studies since the very beginning of their public release (Pastor-Escuredo, Savy, & Luengo-Oroz, 2015). (Henderson, Storeygard, & Weil, 2012). (Otchia & Asongu, 2019) In the malaria literature, most of studies used data extracted from NL datasets as a covariate to understand malaria epidemiology patterns (Lechthaler et al., 2019), most commonly used as part of an urbanicity/landscape indexes (Keiser et … Read more →

# 数理统计讲义

## by 何志坚

数理统计讲义 […] 本讲义为《数理统计》课程配套材料，大部分为课堂讲解内容。第1章介绍了数理统计基本概念。第2章介绍了点估计方法，包括矩法估计、极大似然估计，还介绍了区间估计方法。第3章介绍了假设检验基本概念以及似然比检验方法。第4章介绍了一元线性回归和多元线性回归模型，以及最小二乘估计，相应的区间估计和显著性检验。 献给我的妻子和儿子。 本讲义仅供选修《数理统计》课程的同学学习使用，如有其它用途请联系作者：hezhijian@scut.edu.cn … Read more →

# Statistics Cookbook

## by Kirthana S

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Here are a few recipies based on the type of Data that one has, Procedure: Step 1 : Preprocessing the data by checking for NA and scaling or centering the data. Step 2 : Looking at the data pattern to look for potential patterns or trends. Step 3 : Performing the Analysis Step 4 : Interpreting the Results Step 5 : Using Inference tests to check for stability and reliability if the results. (eg : Permutation, Bootstrap, Jacknife) Step 6 : Concluding the … Read more →

# Modélisation de la transmission du paludisme à échelle micro-géographique

## by Paul Taconet

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). The bookdown package can be installed from CRAN or Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need XeLaTeX. You are recommended to install TinyTeX (which includes XeLaTeX): https://yihui.name/tinytex/. … Read more →

# Doing Meta-Analysis in R

## by Mathias Harrer, M.Sc.¹, Prof. Dr. Pim Cuijpers², Prof. Dr. Toshi A. Furukawa³, Assoc. Prof. Dr. David D. Ebert²

This is a guide on how to conduct Meta-Analyses in R. […] This guide shows you how to conduct meta-analyses in R from scratch. The focus of this guide is primarily on clinical outcome research in psychology. It was designed for staff and collaborators of the Protect Lab, which is headed by Prof. Dr. David D. Ebert. Although this guide will provide some information on the statistics behind meta-analysis, it will not give you an in-depth introduction into how meta-analyses are calculated statistically. It is also beyond the scope of this guide to advise in detail which meta-analytical … Read more →

# FMPH 291 - Phase 1

## by Ruohui(Matt) Chen, Gabriel Carrasco

Codebook of FMPH 291 - Phase 1 […] … Read more →

# Project ideas for Applied Population and Statistical Ecology

## by Trevor Hefley

Project ideas for Applied Population and Statistical Ecology […] This website contains six sources of data that may be used for class projects. If you are interested in a project that uses data that is not presented here, please discuss this with Elsie, Katy or Trevor. It is likely that we can find a data set for your research … Read more →

# 經濟模型程式設計

## by 國立臺北大學 經濟學系 林茂廷老師

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). The bookdown package can be installed from CRAN or Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need XeLaTeX. You are recommended to install TinyTeX (which includes XeLaTeX): https://yihui.org/tinytex/. … Read more →

# Hands-On Machine Learning with R

## by Bradley Boehmke & Brandon Greenwell

A Machine Learning Algorithmic Deep Dive Using R. […] This book is sold by Taylor & Francis Group, who owns the copyright. The physical copies are available at Taylor & Francis and Amazon. Welcome to Hands-On Machine Learning with R. This book provides hands-on modules for many of the most common machine learning methods to include: You will learn how to build and tune these various models with R packages that have been tested and approved due to their ability to scale well. However, our motivation in almost every case is to describe the techniques in a way that helps develop intuition for … Read more →

# The MumfordBrainStats Mixed Models Series: Companion for the YouTube series

## by Jeanette Mumford

The MumfordBrainStats Mixed Models Series: Companion for the YouTube series […] This is a collection of materials that accompanies a YouTube series on the MumfordBrainStats channel about mixed models. Although I normall focus on material related to neuroimaging, this is for a general audience. Each of these chapters should be understandable without watching the video, but one would probably gain the most by watching the videos as well. The chapter titles indicate which video in the series goes along with that chapter. Not all videos have chapter (yet), since I’m only including chapters with … Read more →

# Geostatystyka w R

## by Jakub Nowosad

Introduction to geostatistics with R (in Polish). Skrypt ma na celu wprowadzenie do analiz przestrzennych (GIS) używająć języka programowania R, a następnie zastosowanie uzyskanej wiedzy do wykonania estymacji (interpolacji) oraz symulacji geostatystycznych. […] Skrypt jest obecnie aktualizowany do celów trzeciego wydania, planowanego na luty 2021. Wersje PDF wydania pierwszego i drugiego można znaleźć pod adresem https://nowosad.github.io/publications/. Masz przed sobą skrypt zawierający materiały do ćwiczeń z geostatystyki. Składa się ona z kilkunastu rozdziałów pokazujących jak: wygląda … Read more →

# Modelagem de coortes com dados administrativos

## by Felipe Ferré

Modelagem de coortes com dados administrativos […] A administração pública disponibiliza um volume crescente de dados abertos, cujo uso também aberto implica no fortalecimento da democracia. Boa parte do universo dos dados é operada pela Tecnologia da Informação em Saúde. O Ministério da Saúde disponibiliza mais de 10 bilhões de registros que podem ser usados em estudos ecológicos e mais de 250 milhões de registros de medicamentos individualizados, isto é, há identificador que possibilita juntar registros do mesmo indivíduo a partir de diferentes bases. Outros 3 bilhões de registros da … Read more →

# Doğrusal (Lineer) Regresyon

## by Uğur Dar

Doğrusal (Lineer) Regresyon […] Lineer (Doğrusal) regresyon istatistiksel veri analizinde sıkça kullanılan bir yöntemdir. Lineer regresyon, doğrusal ve sürekli değişkenler için kullanılan bir yöntemdir. Son yıllarda popüler olan makine öğrenmesi açısından ise birçok kaynakta giriş konusudur. Dolayısıyla bu konunun iyi kavranması makine öğrenmesi konusunda kendisini geliştirmek isteyenler için elzemdir. Bu dökümanda doğrusal regresyonun teorik kısımlarından çok, R ile uygulama kısımları yer alacaktır. Aşağıda doğrusal regresyon modelini iki örnek ile nerede ve nasıl kullanabileceğimizi … Read more →

# Stupid Machine

## by Mark Niemann-Ross

A murder mystery solved by a refrigerator. […] Stupid Machine is a Hard Science Fiction novel set in 2062. Car accidents don’t happen. The last one was twenty-some years ago, somewhere around 2040. Which makes Jordan Bishop’s fatal crash in a self-driving vehicle unusual. Maybe even a murder. Jupyter Fuertes works with appliances—rice cookers, ovens, whatever calls for help—coaching them back to proper operation. She hopes for something bigger. She’s hounded by a refrigerator with an impossible question. Araci Belo doesn’t know cars. He was a proud detective of the police force, but now he … Read more →

# R Programlama - Başlangıç

## by Uğur Dar

R Programlama - Başlangıç […] R’da atama işlemi için “=” ve ya “<-” operatörleri kullanılır. Bir nesneye atama yapılırken sıkça kullanılan “<-” operatörüdür. Fonksiyonların içinde ise “=” operatörü kullanılır. Önceden oluşturulmuş bir nesnenin üzerine tekrar atama işlemi yapılır o nesne en son atanan değeri alır, önceki değer silinir. Nesneyi çağırdığımızda değerini görebiliriz Character ve string atamalarında " " kullanılır. Bir nesnenin class’ını görmek için class() fonksiyonundan yararlanılabilir. Sıkça kullanılan classlardan bir tanesi logical(mantıksal). “Global Environment”’dan silmek … Read more →

# 1 Çubuk Grafiği (Bar Graph) | _main.utf8

## by Uğur Dar

116# Panduan Lengkap Analisis Statistika Menggunakan R Commander

## by Mohammad Rosidi

Buku yang memberikan tutorial statistika menggunakan R-Commander, sebuah general user interface (GUI) untuk melakukan analisis dan membuat model statistika. […] … Read more →

# Pursuing Truth: A Guide to Critical Thinking

## by Randy Ridenour

This is a textbook for use in undergraduate critical thinking courses. […] This is a textbook written primarily for my students in PHIL 1502: Critical Thinking, at Oklahoma Baptist University in Shawnee, Oklahoma. There are many good textbooks for critical thinking on the market today, so why write another one? First, all of those books were written for someone else’s course. None cover all of the topics that I would like to cover in class. Second, as I’m sure any student can attest to, textbooks are remarkably expensive, to the point that most of the world’s population cannot afford access … Read more →

# Programación - Desarrollo de Aplicaciones Web

## by Sergio Berdiales

Programación - Desarrollo de Aplicaciones Web […] Este documento incluye mis notas personales y ejercicios prácticos de la asignatura “Programación” del primer curso del Módulo de FPII de Desarrollo de Aplicaciones Web. Este módulo lo estoy cursando en el Centro Integrado de Formación Profesional de la Universidad Laboral de Gijón (curso 2019-2010). Además de hacer los ejercicios en los lenguajes requeridos durante el curso procuraré replicarlos también en los lenguajes que habitualmente utilizo en mi actividad profesional, R y Python. Los contenidos originales sobre los que realizo mis … Read more →

# Module Initiation

## by Claire Della Vedova, pour Data Value

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Pour cette formation il est nécessaire d’installer des versions récentes de R (au moins 3.6.1 ou plus) et de R Studio (Version 1.2.5001 ou … Read more →

# Some Notes on Mathematics

## by Yifei Xiong

This is a note using the bookdown package to write a book. The output format for this is bookdown::gitbook. […] 本站的内容 本站会不定期更新一些笔记, 包括但不限于概率论与数理统计、随机过程、算法分析、代数基础、以及一些计算机相关的内容。如果时间充足，我会用Rmarkdown进行书写，写起来真的很工整。但是如果时间有限，笔记仅以goodnotes导出格式进行显示（这通常为图片或pdf文件） 现在已经更新的内容有： 第？学期：随机积分 第三学期：常微分方程笔记 第三学期：抽象代数笔记 第四学期：复变函数笔记 第四学期：概率论笔记 第四学期：数值分析笔记 第五学期：数理统计笔记 第五学期：实变函数笔记 第五学期：微分几何笔记 作者 … Read more →

# Quantitative Research Methods for Political Science, Public Policy and Public Administration for Undergraduates: 1st Edition With Applications in R

## by Wesley Wehde, Hank Jenkins-Smith, Joseph Ripberger, Gary Copeland, Matthew Nowlin, Tyler Hughes, Aaron Fister, and Josie Davis

Quantitative Research Methods for Political Science, Public Policy and Public Administration for Undergraduates: 1st Edition With Applications in R […] The idea for the graduate level version of this book grew over decades of teaching introductory and intermediate quantitative methods classes for graduate students in Political Science and Public Policy at the University of Oklahoma, Texas A&M, and the University of New Mexico. Despite adopting (and then discarding) a wide range of textbooks, we were frustrated with inconsistent terminology, misaligned emphases, mismatched examples and data, … Read more →

# FMPH291 Assignment 1

## by Gabriel Carrasco Escobar

Using the NHEFS data set answer the following etiological question: Is daily physical activity associated with cholesterol level? […] Using the NHEFS data set answer the following etiological question: Is daily physical activity associated with cholesterol level? Justify your approach: For next week, prepare 2-3 slides with the justification or your approach and the main estimate(s) of interest of your choice by Tuesday, Jan 14 at … Read more →

# Crime by the Numbers

## by Jacob Kaplan

This book introduces the programming language R and is meant for undergrads or graduate students studying criminology. R is a programming language that is well-suited to the type of work frequently done in criminology - taking messy data and turning it into useful information. While R is a useful tool for many fields of study, this book focuses on the skills criminologists should know and uses crime data for the example data sets. Read more →

# Climate-related Risk to European Fish and Fisheries

## by Mark R. Payne, John Pinnegar, Manja Kudahl, Georg Engelhardt

This is a working draft of the CERES (https://ceresproject.eu/) vulnerability analysis (Task 5.3) looking the climate risk of fish and fisheries in Europe. […] This document is a working draft of the CERES (https://ceresproject.eu/) vulnerability analysis (Task 5.3) looking the climate-related risk to fish and fisheries in Europe. The source code for both this manuscript and the associated analysis are stored under version control on GitHub at https://github.com/markpayneatwork/CERES_vulnerability … Read more →

# Github actions with R

## by Chris Brown, Murray Cadzow, Paula A Martinez, Rhydwyn McGuire, David Neuzerling, David Wilkinson, Saras Windecker

An introduction to using github actions with R. […] GitHub actions allow us to trigger automated steps after we launch GitHub interactions such as when we push, pull, submit a pull request, or write an issue. For example, there are actions that will automatically trigger: GitHub actions follow the steps designated in a yaml file, which we place in the .github/workflows folder of the repo. We can add these yaml files to our repo either by clicking on a series of steps on GitHub.com, or using wrapper functions provided by the usethis package, depending on which actions you which to … Read more →

# Measuring what Matters: Introduction to Rasch Analysis

## by Exercise and data: Anthony Clairmont; Assistance, web: Daniel Katz; Direction: Mike Wilton

This book is meant to be used as a guide for DBER participants in the Rasch workshop […] This is meant to be a very general introduction for using the Rasch model to help construct measures and surveys in discipline based education research. This is meant to get you started but is by no means where you should stop. Please see the references section for where to go next. Please see, Wilson (2005) and Bond and Fox (2015) for more. Bond, Trevor G, and Christine Fox. 2015. Applying the Rasch Model : Fundamental Measurement in the Human Sciences. Mahwah, N.J.: L. Erlbaum. Wilson, Mark. 2005. … Read more →

# 經濟數學程式設計專題

## by 國立臺北大學 林茂廷老師

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] 作業 70% 期末專題 30% 學習以下的能力及知識： 基礎Python語法 經濟模型/統計模型（含機器學習）所需面對的數學問題及其求解概念 電腦數值分析求解確定及不確定狀態下的極值問題 本課程雖無電腦程式基礎要求，但建議有基本概念（R或Python均可）。 基礎數學概念： 微分/積分/梯度等 確定狀態下的極值問題： 無限制絛件/有限制條件 基礎統計概念： 隨機變數/估計式/抽樣分配等 不確定狀態下的極值問題： 無限制條件/有限制條件 1. Install Python via Anaconda 連到https://www.anaconda.com/點Download，下載對應自己系統的版本。(請安裝Pyhton 3.X版，其中X為數字 … Read more →

# Comparative Methods

## by Brian O’Meara

How to do comparative methods for evolution and ecology […] This book was created as part of my PhyloMeth class, which focuses on sensibly using and developing comparative methods. It will be actively developed over the course of Spring 2017, so if you don’t like this version (see date above), check back soon! The book is available here but you can fork it, add issues, and look at raw source code at https://github.com/bomeara/ComparativeMethodsInR. [Note I’ll be changing the name of the repo eventually; the course is largely in R (not entirely) but of course many key methods appear in other … Read more →

# High School Math Competition

## by Math Down

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] A circle (O) has radius 10, AB and CD are two chords of the circle. (AB \parallel BC), (AB = 12), (CD=16), what’s the distance between AB and CD? Triangle (ABC), (AB=AC), (\angle ABC = 30^o). (BD) is a diameter of (\triangle ABC)’s circumsribe circle, (CD=\frac{4\sqrt 3}{3}), AD=? (AB) is a chord of circle (O). (E) is the midpoint of (AB). (EC\perp OA), BD is a tangent line of Circle (O). (AB=12), (BD=5). What is the radius … Read more →

# An Introduction to Acceptance Sampling and SPC with R

## by John Lawson

The output format for this book is bookdown::gitbook. […] This e-book was originally written for Stat 462 (Quality Control)(see Description) taught in the Statistics Department at Brigham Young University. It is free to read online here, and is licensed inder the Creative Commons Attribution-NonComercial-ShareAlike 4.0 International License (http://creativecommons.org/licenses/by-nc-sa/4.0/) One of the objectives of Stat 462 is to prepare students to pass the ASQ Certified Quality Process Analyst Exam. The book The Certified Quality Process Analyst Handbook by (Christensen, Betz, and Stein … Read more →

# Sabeis - Sala Aberta de Inteligência em Saúde

## by Felipe Ferré

Wiki do SABEIS - PCDT. […] Bem vindo ao tutorial do SABEIS - PCDT. Utilize o menu ao lado para … Read more →

# ExamPA.net Overview

## by exampatutor

ExamPA.net Overview […] The video solution to the June 2019 Exam PA. Here you can see the project statement for a practice exam which is in the format of a real exam. Cheat Sheet Sample Sign up Sam Castillo is a predictive modeler at Milliman in Chicago, maintains a blog about the future of risk, and won the 2019 SOA Predictive Analytics and Fururism’s Jupyter contest. Contact: … Read more →

# Meta-Workflow

## by Miao YU

This is a workflow for metabolomics studies. […] This is an online handout for mass spectrometry based metabolomics data analysis. It would cover a full reproducible metabolomics workflow for data analysis and important topics related to metabolomics. Here is a list of topics: This is a book written in Bookdown. You could contribute it by a pull request in Github. R and Rstudio are the softwares needed in this … Read more →

# jamoviguiden

## by Jonas Rafi

Lär dig göra independent samples t-test, paired samples t-test, one sample t-test, ANOVA, repeated measures ANOVA, factorial ANOVA, mixed ANOVA, linear regression, och logistic regression i jamovi. jamoviguiden innehåller även avsnitt om csv-filer och skalnivåer. […] Syftet med jamoviguiden är att tillhandahålla snabbstartsguider över vanliga procedurer i jamovi. För dig som söker en grundläggade introduktion till både statistik och jamovi rekommenderar jag gratisboken Learning statistics with jamovi av Danielle J. Navarro och David R. Foxcroft. jonas.rafi psychology.su.se. Detta verk … Read more →

# The jamovi quickstart guide

## by Jonas Rafi

The jamovi quickstart guide features a collection of non-technical tutorials on how to conduct common operations in jamovi. This includes how to conduct independent samples t-test, paired samples t-test, one sample t-test, ANOVA, repeated measures ANOVA, factorial ANOVA, mixed ANOVA, linear regression, and logistic regression. Additionally, the tutorials cover the use of csv files, wide data format, and setting the data type in jamovi. Read more →

# 数据分析残卷

## by 于淼

数据分析世界的清明上河图 […] 学习都是从模仿开始的，而模仿则意味着不加区别的接受，若是混口饭吃，自然也就够了，但模仿多了便能看到这知识表象下的东西。如果还能提炼一下，变成了自己的经验，各种经验互相联系影响，便有了理论。所以学习大都从知识点开始，而以形成一家之言为终，倘若这一家之言可以得到别人的认可，知识就开始传承了。且不论有多少东西会被反复发现发明出来，但是世界在某种程度上是可知的便是智慧生物生存的福利。 就数据分析而言，我自学过很多教材，可以说不同门派间手段差异非常大，但总又妄想一统江湖，所以便有了这份笔记来整合。不论最终的完成度如何，这都只会是一本残卷，因为这世界总有未知，也因此总有希望。 … Read more →

# 现代科研指北

## by 于淼

才疏学浅，不知何为真，仅通少错之法，故不敢言南，仅指北。或曰：现代科研挖坑／跳坑指南 … Read more →

# 自然生活的数学原理

## by 于淼

《自然生活的数学原理》，又名《新毕达哥拉斯主义》是一本于 2222 年出版的小册子，曾获得《银河系漫游指南》编辑认可而入选附录，但因编辑当天在厕所里作出录用决策后发现没带纸而失去机会，目前以薛定谔的猫态存在于作者脑中，旨在用最简单的数学原理进行日常生活决策，不断提升或降低生活幸福感。 生活在地球上的灵长类人类是一种奇怪的智慧动物，其进化后遗症包括但不限于没有发情期或者说性成熟后每时每刻都处于发情期、机体废气排放机制经常失灵、意识对行为存在虚幻的控制解释等等。 … Read more →

# Writing Frictionless R Package Wrappers

## by Bob Rudis

Extending the functionality of R via R’s foreign language interfaces. … Read more →

# 基于R语言的科研信息分析与服务

## by 王敏杰

Scientific Research information service using R […] 在图书馆开设R语言系列讲座也有一年半载了，在此过程中我萌生了用R语言写一本书的想法，一方面是想为学生提供R语言学习范例，另一方面也借此为我校科研人员提供一些科研信息服务。如果此举能做到教学相长，更好地实践和应用数据科学，也算是一次很有意义的尝试，无奈自己时间精力有限，写书进展缓慢。 本书的代码可以公开，您完全可以重复每一过程。本书使用的代码和数据集 … Read more →

# EESA01 Laboratory Manual: Introduction to Environmental Science

## by Andrew Apostoli and Adam Martin

A lab manual for students of Environmental Science […] This book is an open source document. The book is built using the bookdown package (Xie 2019) in R, and pandoc. We cordially invite you to provide us with any feedback or comments that you may have by sending an email to andrew.apostoli@mail.utoronto.ca or adam.martin@utoronto.ca. Please note that any contributions must be licensed under the Creative Commons Attribution-ShareAlike 4.0 International License. For more information, see “License” (1.2) below. This project is coordinated by Andrew Apostoli, and Adam Martin. This book is the … Read more →

# edav.info/

## by Zach Bogart, Joyce Robbins

This resource is a collaborative collection of resources designed to help students succeed in GR5702 Exploratory Data Analysis and Visualization, a course offered at Columbia University. While the course lectures and textbook focus on theoretical issues, this resource, in contrast, provides coding tips and examples to assist students as they create their own analyses and visualizations. It is our hope that students will contribute to edav.info and it will grow with the course. Read more →

# Manipulación de datos e investigación reproducible en R

## by Derek Corcoran

Este libro es una compañia al curso, análisis y manipulación de datos en R […] Para comenzar el trabajo se necesita la última versión de R y RStudio (R Core Team 2019).También se requiere de los paquetes pacman, rmarkdown, tidyverse y tinytex. Si no se ha usado R o RStudio anteriormente, el siguiente video muestra cómo instalar ambos programas y los paquetes necesarios para este curso en el siguiente link. El código para la instalación de esos paquetes es el siguiente: En caso de necesitar ayuda para la instalación, contactarse con el instructor del curso. Si nunca se ha trabajado con R … Read more →

# «Shri Jobim»

## by Dmitry Gorodnichy

Antonio Carlos Jobim songs re-interpreted by Dmitry Gorodnichy […] This album contains my interpretations of my favourite Jobim’s bosa-nova songs in Russian and Ukrainian. These songs have been dear to my heart for many years. - They were much loved by my parents, who could also play some of them on piano. Following the passing of my parents (in 2003 and then in 2016), I started feeling them on a new, deeper, level. The result is this album. The songs are divided in two groups: those about Love (“Grand amor”) and those about the meaning and beauty of Life (“Ocean”). The album also contains … Read more →

# Lecture Notes for Project Management

## by B. Depaire

These are the lecture notes for the course Project Management […] This document contains the lecture notes for the course Project Management (3897) taught at Hasselt University. Each chapter of this document serves to support the lecture presentations and contains a summary in bullet-point style. We advise students to go throught these lecture notes immediately after the lecture and to add your own notes to this … Read more →

# 經濟資料視覺化處理

## by 林茂廷, 國立臺北大學經濟學系

經濟資料視覺化 […] This course is designed to develop the skill of efficient graphic language, where efficiency is defined as the data information delivery that is self-contained, concise, and non-distorting. The programming language is mainly based on R, with a little bit of Javascript toward the end. Though there is no computer programming knowledge required, basic R knowledge will help (the ebook, R for Data Science, would be a good start). By the end of the course, students who learn well should be able to design professional … Read more →

# Population Health Data Science with R

## by Tomás J. Aragón

Population health data science (PHDS). […] We are writing this book to introduce R—a programming language and environment for statistical computing and graphics—to public health epidemiologists, health care data analysts, data scientists, statisticans, and others conducting population health analyses. Recent graduates come prepared with a solid foundation in epidemiological and statistical concepts and skills. However, what is sometimes lacking is the ability to implement new methods and approaches they did not learn in school. This is more apparent today with the emergence of data science … Read more →

# Metode Numerik Menggunakan R Untuk Teknik Lingkungan

## by Mohammad Rosidi

Komputasi metode numerik menggunakan R dengan contoh kasus di bidang teknik lingkungan. Buku ini dapat menjadi rujukan bagi mahasiswa yang ingin mendalami proses komputasi metode numerik menggunakan R. Buku ini disusun sedemikian rupa dengan bahasa yang sederhana agar mudah dipahami oleh mahasiswa. […] Metone Numerik Menggunakan R Untuk Teknik Lingkungan merupakan buku yang penulis tulis dengan harapan dapat menjadi salah satu rujukan mahasiswa khusunya Teknik Lingkungan yang tertarik untuk belajar R. Selain itu, buku ini merupakan salah satu cara pengabdian penulis guna menyediakan sumber … Read more →

# Techincal Analysis with R (second edition)

## by Ko Chiu Yu

This is an introductory textbook that focuses on how to use R to do technical analysis. […] Since the first edition has been published in 2018, I have received numerous comments on how to improve the book. The second edition attempts to accommodate these suggestions as much as possible. The book is completely rewritten and reorganized. In particular, the core part of trading rule is now divided into three chapters: day-trading rule, non-day trading rule, and complex trading rule. Day-trading rule requires only basic knowledge of conditional if statements and for loops. Non-day trading … Read more →

# Recoding Introduction to Mediation, Moderation, and Conditional Process Analysis

## by A Solomon Kurz

This project is an effort to connect his Hayes’s conditional process analysis work with the Bayesian paradigm. Herein I refit his models with my favorite R package for Bayesian regression, Bürkner’s brms, and use the tidyverse for data manipulation and plotting. […] Andrew Hayes’s Introduction to Mediation, Moderation, and Conditional Process Analysis text, the second edition of which just came out, has become a staple in social science graduate education. Both editions of his text have been from a frequentist OLS perspective. This project is an effort to connect his work with the Bayesian … Read more →

# Interactive web-based data visualization with R, plotly, and shiny

## by Carson Sievert

A useR guide to creating highly interactive graphics for exploratory and expository visualization. […] This is the website for “Interactive web-based data visualization with R, plotly, and shiny”. In this book, you’ll gain insight and practical skills for creating interactive and dynamic web graphics for data analysis from R. It makes heavy use of plotly for rendering graphics, but you’ll also learn about other R packages that augment a data science workflow, such as the tidyverse and shiny. Along the way, you’ll gain insight into best practices for visualization of high-dimensional data, … Read more →

# R Graphics Cookbook, 2nd edition

## by Winston Chang

This cookbook contains more than 150 recipes to help scientists, engineers, programmers, and data analysts generate high-quality graphs quickly—without having to comb through all the details of R’s graphing systems. Each recipe tackles a specific problem with a solution you can apply to your own project and includes a discussion of how and why the recipe works. […] Welcome to the R Graphics Cookbook, a practical guide that provides more than 150 recipes to help you generate high-quality graphs quickly, without having to comb through all the details of R’s graphing systems. Each recipe … Read more →

# Some Notes About Math

## by 熊逸飞

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). The bookdown package can be installed from CRAN or Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need XeLaTeX. You are recommended to install TinyTeX (which includes XeLaTeX): https://yihui.org/tinytex/. … Read more →

# Multivariate Statistical Analysis using R

## by Theodore Wiebold

One, two, and multiple-table analyses. […] Advice: Use the simplest method that provides the clearest … Read more →

# The Open Quant Live Book

## by OpenQuants.com

The Open Quant Live Book […] The book aims to be an Open Source introductory reference of the most important aspects of financial data analysis, algo trading, portfolio selection, econophysics and machine learning in finance with an emphasis in reproducibility and openness not to be found in most other typical Wall Street-like references. The Book is Open and we welcome co-authors. Feel free to reach out or simply create a pull request with your contribution! See project structure, guidelines and how to contribute here. First published at: openquants.com. Licensed under Attribution-NonCommer … Read more →

# A Minimal Book Example

## by Riccardo Esclapon

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). The bookdown package can be installed from CRAN or Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need XeLaTeX. You are recommended to install TinyTeX (which includes XeLaTeX): https://yihui.name/tinytex/. … Read more →

# Course Handouts for Bayesian Data Analysis Class

## by Mark Lai

This is a collection of my course handouts for PSYC 621 class in the 2019 Spring semester. Please contact me [mailto:hokchiol@usc.edu] for any errors (as I’m sure there are plenty of them). […] This is a collection of my course handouts for PSYC 621 class. The materials are based on the book by McElreath (2016), the brms package (Bürkner 2017), and the STAN language. Please contact me for any errors (as I’m sure there are plenty of them). Bürkner, Paul-Christian. 2017. “brms: An R Package for Bayesian Multilevel Models Using Stan.” Journal of Statistical Software 80 (1): 1–28. … Read more →

# On the Fruit of the Holy Spirit

## by Kevin Morales

This paper examines the role of the Holy Spirit in fructifying. […] This paper examines the role of Galatians chapter five, verses twenty-two through twenty-six in the life of a Christian. The first two verses in consideration speak of the fruit of the Spirit, and further exphasize that, when walking in the Spirit, a Christian is not under the law; the next two verses harmonize the role of Christ’s work on the cross, and the role of the Holy Spirit in leading a Christians. The last verse in consideration is an exortation to act according to the flesh. No serious study of the Bible is … Read more →

# Utah IR R tools user guide

## by UDWQ Integrated report team

A guide to installing R and the necessary R packages developed for Utah’s Integrated Report process. […] bookdown.org/jakevl/IR-R-tools-guide This guide walks secondary reviewers through installing R and its associated packages on their computer. A snapshot of the commands required to run the R packages is available on the “Install or Access R Tools” card on the Current Report Development Trello Board. Please refer to the FAQs section for answers to common questions, but note that this section will grow as we move through the process as group. If you still have questions about running R or … Read more →

# Aprender R: iniciación y perfeccionamiento

## by François Rebaudo

Un guía para adquirir las bases de la programación con R y conducir de forma efectiva su gestión y análisis de datos. […] Agradezco a todos los colaboradores que ayudaron a mejorar este libro con sus consejos, sugerencias de cambios y correcciones (en orden alfabético): Las versiones de gitbook, html y epub de este libro usan los iconos de fuente abierta de Font Awesome (https://fontawesome.com). La versión en PDF utiliza los iconos del proyecto Tango disponibles en openclipart (https://openclipart.org/). Este libro fue escrito con el paquete R bookdown (https://bookdown.org/). El código … Read more →

# Supplement to Longitudinal analysis of early childhood stunting in low-resource settings

## by Jade Benjamin-Chung et al.

This is supplementary information to Longitudinal analysis of early childhood stunting in low-resource settings […] Recommended citation: Benjamin-Chung J, et al. 2020. Longitudinal analyses of early childhood stunting in low-resource settings. Journal Name. doi. This site contains supplementary information to the Longitudinal analyses of early childhood stunting in low-resource … Read more →

# Grand Slam Heroes

## by Ganesh Viswanathan and Roma Dutta

Grand Slam Heroes […] “The Open Era is the current era of professional tennis. It began in 1968 when the Grand Slam tournaments allowed professional players to compete with amateurs, ending the division that had persisted since the dawn of the sport in the 19th century.” - Wikipedia Github Link: https://github.com/ganesh2512/finalProject Rstudio Cloud Link: https://rstudio.cloud/project/704614 Bookdown Link: https://bookdown.org/rdutta4/bookdown-grandslam/ ShinyAppsIOLink: https://ganesh-viswanathan.shinyapps.io/finalProjectShiny/ Data source Link: https://github.com/rfordatascience/tidytues … Read more →

# Data Visualization Design Project

## by Joshua Ganz, Julian Mucha & Eric Rwabuhihi

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Millions of people around the world like to watch football, commonly known as soccer in the United States. It is the most watched sport in Europe, Asia, and Africa. Most recently, the United States has attracted many fans, due in part to the US women’s soccer team, which has won the FIFA Women’s World Cup on four occasions since its inauguration in 1991. The goal of this project is to analyze how teams performed in the world cup in the last 28 years: country … Read more →

# GSL causeway and salinity

## by Jake Vander Laan (jvander@utah.gov), Utah Division of Water Quality

GSL causeway and salinity […] This documents contains a series of figures and analyses of Great Salt Lake (GSL) salinity, focused on the effects of causeway culvert closure and subsequent bridge and breach opening. These were developed to provide background information, context, and discussion points for the first GSL Salinity Advisory Committee Meeting. All data presented in this document are publicly available via USGS, UGS, or DWQ websites. Notes: … Read more →

# Teaching and Learning with Jupyter

## by Lorena A. Barba, Lecia J. Barker, Douglas S. Blank, Jed Brown, Allen B. Downey, Timothy George, Lindsey J. Heagy, Kyle T. Mandli, Jason K. Moore, David Lippert, Kyle E. Niemeyer, Ryan R. Watkins, Richard H. West, Elizabeth Wickes, Carol Willing, and Michael Zingale

A handbook on teaching and learning with Jupyter notebooks. […] Lorena A. Barba, Lecia J. Barker, Douglas S. Blank, Jed Brown, Allen B. Downey, Timothy George, Lindsey J. Heagy, Kyle T. Mandli, Jason K. Moore, David Lippert, Kyle E. Niemeyer, Ryan R. Watkins, Richard H. West, Elizabeth Wickes, Carol Willing, and Michael Zingale This handbook is for any educator teaching a topic that includes data analysis or computation in order to support learning. It is not just for educators teaching courses in engineering or science, but also data journalism, business and quantitative … Read more →

# Data Skills for Reproducible Science

## by psyteachr.github.io

This course provides an overview of skills needed for reproducible research and open science using the statistical programming language R. Students will learn about data visualisation, data tidying and wrangling, archiving, iteration and functions, probability and data simulations, general linear models, and reproducible workflows. Learning is reinforced through weekly assignments that involve working with different types of data. Read more →

# R Markdown: The Definitive Guide

## by Yihui Xie, J. J. Allaire, Garrett Grolemund

The first official book authored by the core R Markdown developers that provides a comprehensive and accurate reference to the R Markdown ecosystem. With R Markdown, you can easily create reproducible data analysis reports, presentations, dashboards, interactive applications, books, dissertations, websites, and journal articles, while enjoying the simplicity of Markdown and the great power of R and other languages. Read more →

# Researching and writing for Economics students

## by Dr. David Reinstein, University of Exeter, main web page, innovationsinfundraising.org, Twitter: givingtools

An interactive guide to doing Economics research, mainly aimed at undergraduates; a work in progress […] Note to friends I’ve asked to look at this book… Thanks for looking at this. You can leave feedback however you like, including via ‘hypothes.is’ as suggested just below. Let me know if you want acccess to the github account. This is a work in progress; I’ve moved most, but far from all of the content over from my previous notes on this, which date from 2014. This was mainly targeted at the Essex Undergraduate Economics students; roughly 400 a year were required to write dissertations. … Read more →

# Statistical Inference via Data Science

## by Chester Ismay and Albert Y. Kim

An open-source and fully-reproducible electronic textbook for teaching statistical inference using tidyverse data science tools. […] This is the website for Statistical Inference via Data Science: A ModernDive into R and the tidyverse! Visit the GitHub repository for this site, find the book at CRC Press, or buy it on Amazon. This work by Chester Ismay and Albert Y. Kim is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International … Read more →

# Knowing the KPI Trend Without Seeing the Chart

## by Ace Mark Acebedo

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] The skill of charting and consequently reading the KPI trend is one of the most basic things that I had to learn on the job as a Telco Engineer. I am using the skill to evaluate cell performance over a definite observation period (i.e. days, weeks, months, years etc.). Since it is a most basic skill, creating a guideline to do it nowadays is never a matter of serious consideration if not a downright laughable joke. But, just like me, if you’ve become … Read more →

# bookdown: Authoring Books and Technical Documents with R Markdown

## by Yihui Xie

A guide to authoring books with R Markdown, including how to generate figures and tables, and insert cross-references, citations, HTML widgets, and Shiny apps in R Markdown. The book can be exported to HTML, PDF, and e-books (e.g. EPUB). The book style is customizable. You can easily write and preview the book in RStudio IDE or other editors, and host the book wherever you want (e.g. bookdown.org). Read more →

# Data Analysis for Psychology in R (dapR1) - Labs

## by Department of Psychology, University of Edinburgh

This is the page that contains the course labs materials […] Data Analysis for Psychology in R 1 (dapR1) is your first step on the road to being a data, programming and applied statistics guru! This course provides a introduction to data, R and statistics. It is designed to work slowly through conceptual content that form the basis of understanding and working with data to perform statistical testing. At the same time, we will be introducing you to basic programming in R, covering the fundamentals of working with data, visualization and simple statistical tests. The overall aim of the … Read more →

# Readings in applied data science

## by Qiushi Yan

Readings in applied data science […] This project is highly motivated and inspired by stats337 at Stanford University offered by Hadley Wickham, and Data Science with R: A Resource Compendium by Martin Monkman. They both provided great reading materials in data analysis with R, or applied data science in general. Here I attempt to finish one or two papers per week, draw a brief summary, and document my personal … Read more →

# 資料科學程式設計–進階

## by 國立臺北大學 林茂廷老師

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] 本課程為自主學習，電子書的每個章節會說明要讀那些參考資料，接著會有練習題組來測驗同學的理解狀況。 … Read more →

# 数据科学与 R 语言

## by 黄湘云

数据操作、数据分析、数据建模、动态报告和数据可视化 […] 这本书还处于一个很早期的阶段 GNU R 是发布在 GPL-2/3 下的开源自由软件，意味着只要你遵循该协议，就可以自由地获取、修改和发布R 源代码，R 本身的这种开源自由的属性，决定你可以免费地使用它。《The Art of R Programming》的作者 Norm Matloff 给出使用 R 语言的四个优势：它是统计学家开发的，也是为统计学家打造的；内建的矩阵类型和矩阵操作非常高效；不管是来自基础 R 还是 CRAN 上的绘图包，都提供强大的绘图功能；还有优秀的并行能力1，最近他更是在数据科学中全面比较了 R 与 Python2。关于 R 语言和 Python 语言的对比，网络上充斥着很多的文章，除了赞扬，还有表示反对的声音，如 R语言采用的对 GPL 协议3，甚至有人列举了逃离 R 语言阵营的10大上榜理由4，datacamp 提供了一份较为完整的对比图，仅供参考5。如果你是学统计的学生或者数据分析师，我都建议你先学习 … Read more →

# Summary of Presentation at the International Justice Forum in Nur-Sultan 2019

## by Jesper Wittrup

My presentation in Nur-Sultan. […] Accurate measurement of court workload is an essential component of modern judicial management. Many tasks – including performance measurement, budgeting, allocation of judges and other staff, and assessments of judicial maps - crucially depend upon a proper measurement of judicial workload. Unfortunately, court workload is in many countries still primarily assessed by the total number of cases, not taking into account distinctions between routine cases and very complex and time-consuming cases. This article demonstrates – by reference to a concrete … Read more →

# 现代统计图形

## by 谢益辉, 黄湘云, 赵鹏

现代统计图形书稿 […] 本书写作过程中收到来自 Song Li、 JackieMe 、 yang 的贡献，在此表示感谢，我们欢迎更多的人参与改进本书。 本书搬迁过程中更新、替换了原稿中的很多代码，现在与本书配套的 R 软件版本是 R Under development (unstable) (2019-11-11 r77397)，我们同时也在 R 版本 3.6.1 中完成测试。为方便读者复现本书中的计算结果和统计图形，同时也为了方便在 Travis 上自动测试贡献者提交的 PR 和自动部署每次提交的修改，本书的运行环境已经被打包成 Docker 镜像，托管在 Docker Hub 上，镜像地址是 https://hub.docker.com/r/xiangyunhuang/msg-book， 读者可从 Docker Hub 上下载，也可根据目录 docker/ 下的 Dockerfile … Read more →

# Accessible Web Development

## by Kevin Morales

This book introduces a completely blind person to web development using HTML, CSS, and JavaScript. […] Welcome to Accessible Web Development! Up to today, any type of graphics programming, or any kind of software development that is visual in nature is inaccessible to people with complete vision loss, for ovbious and nonovbious reasons. To complicate matters further, text editors are mostly written with sighted programmers in mind. Furthermore, it is estimated that seventy percent of the legally blind population is unemployed. Developers of software libraries write their documentation and … Read more →

# Uber Movement dataset : playing with spatial data

## by Clement Lefevre

Using the Uber Movement dataset, we combine it with the OpenStreetMap data for Berlin. […] Uber released for some cities the datasets of their drivers movement. Those include the OSM way identifier, the mean and standard speed deviation. In order to anonymize them, the data have been aggregated per hour. Let’s have a look at the Berlin data for the month of June 2019, and how they are distributed in space and time. For this, we will combine those data with the OpenStreetMap shapefile for Berlin. Through this book, we will use some concepts of data analysis … Read more →

# Book_MI.utf8.md

## by mwheymans

Copyright ©2019 by Heymans and Eekhout All rights reserved. No part of this book may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage and retrieval system, without permission in writing from the … Read more →

# Modelltheoretische Grundlagen wirtschaftspolitischer Kontroversen

## by mgwkshiny

… […] 1. Der empirische Bezugspunkt unserer Modellwelt 2. Die gesamtwirtschafliche Nachfrage 3. Der private Konsum 4. Die Investitionen 5. Die staatliche Nachfrage 6. Das Gütermarktgleichgewicht und der Multiplikator 7. Die IS-Kurve 8. Arbeitsangebot, Beschäftigung und Produktivität 9. Inflation - der Konflikt zwischen Löhnen und Profiten 10. Wirtschaftspolitik im neu-keyensianischen 3-Gleichungen-Modell 11. Vom neu-keynesianischen zum post-keynesiansischen … Read more →

# 可视化软件工具与应用 课程作业

## by 闫求识

可视化软件工具与应用 课程作业 […] This bookdown project contains my code and solutions to the data visualization course https://github.com/haoyuns/Fall-2019. … Read more →

# Réalisation de Mini Projets R

## by content

Site DIAMBAN Lamine bookdown::gitbook. […] Ce blog est un mémoire qui regroupe quelques uns des projets que je réalise en statistique spécialement en R et Tidyverse. En effet, étudiant en M2 Statistique et Science des données à l’IM2AG de Grenoble, j’ai eu à effectuer divers projets dans le cadre de ma formation. Mais aussi, par pur passion et afin de mieux consolider mes connaissances, j’aime appliquer la science des statistiques dans mes domaines d’intérêt. Il est vrai qu’en dehors de mes études, j’ai une passion pour les nouvelles technologies, le sport, le cinéma notamment les courts … Read more →

# Untitled

## by Slezák Martin

Untitled […] Ezzel az R kóddal lehet kirenderelni a könyvet RStudióból bookdown::publish_book(render = … Read more →

# Introducción a R y SIG

## by Paúl Bravo L. & Francisco Salgado C.

Introducción a R y SIG […] En esta guía tutorial, presentamos los conceptos y herramientas clave de R y su relación con los Sistemas de Información Geográfica. Articulo “Teach yourself programming in ten years” (Peter Norvig): http://norvig.com/21-days.html A Sufficient Introduction to R: https://dereksonderegger.github.io/570L/A_Sufficient_Introduction_to_R.pdf Introducción a R basada en las lecciones del paquete Swirl. Sean Kross, Nick Carchedi, Bill Bauer, Gina Grdina, Filip Schouwenaars, Wush Wu. R para principiantes: https://cran.r-project.org/doc/contrib/rdebuts_es.pdf RPubs (Daniela … Read more →

# Introduction to R Markdown

## by Michael Clark

This document will introduce participants to the basics of R Markdown. After an introduction to concepts related to reproducible programming and research, demonstrations of standard markdown, as well as overviews of different formats, will be provided, including exercises. […] … Read more →

# Notes on Modern Statistics for the Social and Behavioral Sciences (MSSBS)

## by Peter Baumgartner

This book accompanies “Rand, W. (2017). Modern Statistics for the Social and Behavioral Sciences: A Practical Introduction, Second Edition (New edition). Boca Raton, FL: Taylor & Francis Inc.”. I intend to learn and explore the concepts and procedures taught in this introductory book. Please keep in mind that this book is just a kind of training exercise for me. There will be no new insights presented, and as I am still learning the basics of R and statistics, you may find misunderstandings and errors in my writings. For authoritative reference, you have the consult the original book. Read more →

# Great Salt Lake Nutrient Analyses - figures only

## by Jake Vander Laan, Utah Division of Water Quality

Great Salt Lake Nutrient Analyses - figures only […] This is a figures only version of a set of GSL water quality and nutrient analyses. All data are drawn from USGS NWIS or EPA WQP. For details and code see: bookdown.org/jakevl/gsl-nutrients This project remains in active development and does not represent any official agency position. The first section, ‘Basic analyses & figures’, provides a set of contextual figures meant to characterize spatial and temporal variability of important water quality, lake elevation, and tributary discharge parameters. The second section, ‘Nutrient pools & … Read more →

# Make money with machine learning

## by Siraj Raval, revisited by Kim NOËL

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This book is the personnal transcription of the course provided by Siraj Raval. A drama related to content copyright stained for Raval during this course, and many students including me were disturbed. This event swelled a lot and the motivation to progress in this course was affected. So I decided to propose a version with more explanations and details. I will provide a list of tutorials to follow in order to complete this course. This is a book written in … Read more →

# Statistical Inference via Data Science

## by Chester Ismay, Albert Y. Kim, and Chris Teplovs

An open-source and fully-reproducible electronic textbook for teaching statistical inference using tidyverse data science tools. […] Please note that this book has been modified by Dr. Chris Teplovs for use in SI 544. This is version 0.6.0.9000 of ModernDive published on August 15, 2019. For previous versions of ModernDive, see Section 1.4. While a PDF version of this book can be found here, this is very much a work in progress with many things that still need to be fixed. We appreciate your patience. This book assumes no prerequisites: no algebra, no calculus, and no prior … Read more →

# An Introduction to Probability and Simulation

## by Kevin Ross

This textbook presents a simulation-based approach to probability, using the Symbulate package. […] Why study probability? Why use simulation to study probability? This book uses the Python package Symbulate (https://github.com/dlsun/symbulate) which provides a user friendly framework for conducting simulations involving probability models. The syntax of Symbulate reflects the “language of probability” and makes it intuitive to specify, run, analyze, and visualize the results of a simulation. In Symbulate, probability spaces, events, random variables, and random processes are symbolic … Read more →

# Bayesian Hierarchical Models in Ecology

## by Steve Midway

This is a book that is build on lectures from a course of the same name. […] Welcome to Bayesian Hierarchical Models in Ecology. This is an ebook that is also serving as the course materials for a graduate class of the same name. There will be numerous and on-going changes to this book, so please check back. And don’t hesistate to email me if you have questions, comments, or for anything else. To start, let’s calrify the title of this text—it should be Hierarchical Models in Ecology Using Bayesian Inference. A Bayesian Hierarchical Model is more a term of convenience than accuracy, as … Read more →

# Data Science für Klein- und Mittelbetriebe

## by Jürgen Gruber

Big Data, Data Science und Analytics sind die Buzz-Wörter der heutigen Zeit. Doch was verbirgt sich dahinter? Ist es nur für Großunternehmen möglich, die neuen Technologien einzusetzen? Mit dem vorliegenden Buch wird versucht eine Einführrung in das Thema zu geben. Dies vor allem aus Sicht der Praxis. Speziell aus dem Blickwinkel der Betriebswirtschaft werden Use-Cases versucht einfach und nachvollziehbar darzustellen. Viel Spass auf der Entdeckungsreise. Read more →

# ECON 41 Labs

## by Gabriel Butler UCLA Global Classroom

ECON 41 Labs […] This book is an R-based statistical programming companion for ECON 41 - Statistics For Economists, an undergraduate course for Economics majors offered at the University of California, Los Angeles. More specifically, it has been created to augment the version of this course that is offered at Jinling High School in Nanjing, Jiangsu, China as part of the Global Classroom program that is part of the UCLA International Institute. More basic information about our program and about me is available at the links below. Go Bruins! 金陵中学中美班，加油！ https://www.international.ucla … Read more →

# Field Guide to the R Ecosystem

## by Mark Sellors

This guide aims to introduce the reader to the main elements of the R ecosystem. […] This work is licensed under a Creative Commons Attribution 4.0 International … Read more →

# 資料科學與R語言

## by 曾意儒 Yi-Ju Tseng

介紹如何使用R語言完成資料讀取、處理、分析與呈現，以及大數據技術與R的整合 […] 本書介紹如何使用R語言完成資料讀取 (檔案、透過API擷取或爬蟲)、資料清洗與處理、探索式資料分析、資料視覺化、互動式資料呈現 (搭配Shiny) 與資料探勘等，並介紹R與Hadoop Ecosystems介接方法。 資料探勘章節尚未完成，epub版本格式微調中。 如要一次安裝所有本書會使用到的套件，可在R內執行以下程式碼： 本書為長庚大學資訊管理學系 大數據分析方法課程教學使用書籍，並可搭配YouTube平台上的教學影片參考使用，影片閱讀清單詳見本書最末章節Ch 13 教學影片資訊。 如果您想修改文字或範例，歡迎透過此連結或是透過GitHub issue提供建議與回饋。 … Read more →

# Contributing to GNU R

## by Lionel Henry

How to contribute to GNU R. […] Contributing to a large open source project like GNU R can be intimidating. This document aims to reduce the technical barriers to contributing and will help you get started on building R, modifying it, and checking that you haven’t broken anything. At the moment, Contributing is written from the perspective of macOS users, but should be helpful to users of all platforms. R Internals, the language specification. R Installation and Administration, the official guide on how to install R. R’s internal C API, an unofficial documentation for the internal … Read more →

# 数据科学工作流

## by 黄湘云

在 tidyverse 的生态下，整理一些数据分析和建模的 10 个案例，一个案例一个章节，数据需要比较有代表性，一定要侧重统计思维在数据分析和建模过程中引导作用，要谈数据收集、存储、探索性分析、建模和解释的过程。比如爬取数据存储到数据库，EDA 和建模及分析报告呈现。更加注重可解释性，更加注重统计学习和机器学习之间联系，特别是传统数理统计与数据挖掘技术的关系 […] … Read more →

# Data Science at the Command Line

## by Jeroen Janssens

This is the website for Data Science at the Command Line, published by O’Reilly October 2014 First Edition. This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. To get you started—whether you’re on Windows, macOS, or Linux—author Jeroen Janssens has developed a Docker image packed with over 80 command-line tools. Discover why the command line is an agile, scalable, and extensible … Read more →

# Kursmaterial till Certifierad Data Scientist

## by Ferrologic

Det här dokumentet innehåller kursmaterial och övningar för det första blockets R-övningar. […] För att ta del av det här materialet behöver du inte några särskilda förkunskaper. Övningarna och upplägget följer boken R for Data Science av Hadley Wickham och Garrett Grolemund som finns gratis. Den boken är ett utmärkt fördjupande komplement till det här … Read more →

# MANUAL PARA LA APLICACIÓN DE INDICADORES DE PRODUCCIÓN CIENTÍFICA EN EL ECUADOR ASOCIADO A SCOPUS

## by Aplicación por: Jeysson Chuquin, Wagner Salazar

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] El objetivo de este manual es conocer los índices de producción científica en el Ecuador asociado a Scopus, siendo la Producción Científica en Ecuador una de las actividades más representativas en el criterio de la evaluación institucional con un 9% de peso. La producción Científica es un factor importante para determinar la calidad de una IES (Institución de Educación Superior) y para determinar el crecimiento de un país. El presente manual presenta como es … Read more →

# Text as Data para Ciências Sociais

## by Davi Moreira

Compilação de métodos e técnicas para análise automatizada de conteúdo […] A partir da produção de material para o curso Text as Data: análise automatizada de conteúdo que ministrei no MQ-UFMG em 2019 e no artigo que publiquei em coautoria com Maurício Izumi (Izumi and Moreira 2018), esse livro tem como propósito difundir nas ciências sociais e humanidades técnicas e métodos de análise automatizada de conteúdo usando a linguagem R. O principal objetivo do livro é ser tutorial prático de uso e aplicação de técnicas e métodos de análise automatizada de conteúdo na língua portuguesa através da … Read more →

# QA of Code

## by Joshua Halls

This is a draft of QA for Coding guidance […] ALPHA This is a draft of Government Statistical Service guidance. It is unpublished and does not represent the views of the ONS or the GSS. … Read more →

# Visualisation with Python

## by Jumping Rivers

Blah […] Jumping Rivers is an analytics company whose passion is data and machine learning. We help our clients move from data storage to data insights. Jumping Rivers has delivered quality data insights from day 1. Based in Newcastle and founded in 2016, the company is bringing a fresh approach to the world of data analytics. Jumping Rivers offer a number of training options in R and Python ranging from introductory programming to applied machine learning, automated reporting and app development. We also offer the facility to create bespoke training programs for our clients to make sure … Read more →

# 課程介紹 | ntpu-datavisualization.utf8.md

## by tpemartin

經濟資料視覺化 […] This course is designed to develop the skill of efficient graphic language, where efficiency is defined as the data information delivery that is self-contained, concise, and non-distorting. The programming language is mainly based on R, with a little bit of Javascript toward the end. Though there is no computer programming knowledge required, basic R knowledge will help (the ebook, R for Data Science, would be a good start). By the end of the course, students who learn well should be able to design professional … Read more →

# R for Data Science: Exercise Solutions

## by Jeffrey B. Arnold

Solutions to the exercises in “R for Data Science” by Garrett Grolemund and Hadley Wickham. […] If you find any typos, errors, or places where the text may be improved, please let me know. The best ways to provide feedback are by GitHub or hypothes.is annotations. Opening an issue or submitting a pull request on GitHub Adding an annotation using hypothes.is. To add an annotation, select some text and then click the on the pop-up menu. To see the annotations of others, click the in the upper right-hand corner of the page. This book contains the exercise solutions for the book R for Data … Read more →

# Grange-Lab Manual

## by Jim Grange

The Grange-Lab Manual provides information on all you want (or need) to know about working in the Grange Lab. […] … Read more →

# Mál- og tegurfræði

## by Brynjólfur Gauti Jónsson

Glósur mínar úr áfanganum kenndum í Háskóla Íslands […] Vika 1: Mengjafræði, aukna rauntalnalínan, raðir. Vika 2: Jordan-mælanleg mengi, tröppuföll, Riemann-heildi. Vika 3: Legesbue: Utanmálið, mælanleg mengi og málið. Vika 4: Málrúm Vika 5: Vika 6: Vika 7: Heildanleg föll Vika 8: Vika … Read more →

# R Cookbook, 2nd Edition

## by James (JD) Long, Paul Teetor

Second edition of R Cookbook […] R is a powerful tool for statistics, graphics, and statistical programming. It is used by tens of thousands of people daily to perform serious statistical analyses. It is a free, open source system whose implementation is the collective accomplishment of many intelligent, hard-working people. There are more than 10,000 available add-on packages, and R is a serious rival to all commercial statistical packages. But R can be frustrating. It’s not obvious how to accomplish many tasks, even simple ones. The simple tasks are easy once you know how, yet figuring … Read more →

# R for Statistics in EPH

## by Daniel J Carter

R for Statistics in EPH […] Welcome to R for STEPH. This ‘book’ offers the chance to supplement your learning in Stata by conducting the computer practical sessions in R. By the end of this book, you will have enough proficiency in R to carry out a number of basic analyses and understand principles that will allow you to program more complex analyses. Any questions about the content in this book can be directed to Daniel Carter via email or via Twitter if you’re into that sort of thing. There is also the invaluable resource that is Stack Exchange. Chances are high that if you’re running … Read more →

# R for MSc DH/RSHR/Epi

## by Daniel J Carter

R for MSc DH/RSHR/Epi […] Welcome to the two day Introduction to R for MSc Epi and MSc RSHR. Before you attend the course, you will need to ensure you have your own R setup on your computer. The getting started page will instruct you how to do that. Please do this as early as possible before the course to ensure that you can get help if you encounter any issues. How to interact with this course: Course materials come in three flavours. First, there is the Bookdown file you are reading right now in your web browser - this contains all the code, output, exercise solutions (after the course is … Read more →

# Political Compass in the Random World

## by Tejendra Pratap Singh

On the robustness of the Political Compass […] This is to test how the Political Compass Test performs if it is supplied with randomly selected choices. Theoretically, it should gives us zero on the coordinate axis. I will insert the image of the result from the above values inserted in the Political Compass Test. Let’s see what happens. Results can be found at https://www.politicalcompass.org/yourpoliticalcompass?ec=-1.63&soc=-0.87. Although the resulting coordinates are very close to the origin, it looks like the test is made to elicit responses in the third quadrant. I also attach the … Read more →

# A Minimal rTorch Tutorial

## by Alfonso R. Reyes

This is a minimal tutorial of using the rTorch package to have fun while doing machine learning. This book was produced with bookdown. […] You need two things to get rTorch working: Install Python Anaconda. Preferrably, for 64-bits, and above Python 3.6+. Install R, Rtools and RStudio. Install rTorch from CRAN or GitHub. Note. It is not mandatory to have a previously created Python environment with Anaconda, where PyTorch and TorchVision have already been installed. This step is optional. You could also get it installed directly from the R console, in very similar fashion as in R-TensorFlow … Read more →

# Computational Genomics with R

## by Altuna Akalin

A guide to computationa genomics using R. The book covers fundemental topics with practical examples for an interdisciplinery audience […] The aim of this book is to provide the fundamentals for data analysis for genomics. We developed this book based on the computational genomics courses we are giving every year. We have had invariably an interdisicplinary audience with backgrounds from physics, biology, medicine, math, computer science or other quantitative fields. We want this book to be a starting point for computational genomics students and a guide for further data analysis in more … Read more →

# R Companion to Real Econometrics

## by Tony Carilli

This book looks at the R code necessary to complete the end of chapter exercises in Bailey’s Real Econometrics […] The intended audience for this book is anyone make using of Real Econometrics: The Right Tools to Answer Important Questions 2nd ed. by Michael Bailey who would like to learn the R code necessary to complete the end of chapter exercises. We really heavily on the tidyverse a collection of packages that shares an underlying design philosophy, grammar, and data structures. We also make use of a variety of packages (bundles of code) where it will make coding more straightfoward in … Read more →

# Rad: R for academics

## by Marius Mather

An accessible introduction to R that doesn’t assume programming experience […] This is an experiment in running some short, informal training sessions to get people started with a programming approach to data management and analysis. We’ll be using R, but a lot of the concepts in R will transfer to other software. There may also be some room to include some info about basic web design or other related topics - some of this can be done through R, and more info can be provided if needed. Training will be pitched at the beginner level - you don’t need to know the difference between a CSS and a … Read more →

# Métodos Cuantitativos

## by Aleksander Dietrichson, PhD

Material de Cátedra para el curso «Metodologías cuantitativas». […] Este texto ha sido editado en respuesta a la aparente falta de un libro de texto introductorio al análisis cuantitativo y estadísticas acesible y moderno en castellano. Si bien fue concebido como material de cátedra para Metodologías cuantitativas materia que dicta el autor en la Escuela de Humanidades de la Universidad Nacional San Martín, se adaptará fácilmente a cursos introductorios de estadísticas en … Read more →

# Lab Guide to Quantitative Research Methods in Political Science, Public Policy & Public Administration

## by Joseph Ripberger, Cody Adams, Alex Davis, and Josie Davis

A lab guide to quantitative research methods in R. … Read more →

# Open Forensic Science in R

## by Editor: Sam Tyner, Ph.D.

This book is for anyone looking to do forensic science analysis in a data-driven and open way. Whether you are a student, teacher, or scientist, this book is for you. We take the latest research, primarily from the Center for Statistics and Applications in Forensic Evidence (CSAFE) and the National Institute of Standards and Technology (NIST) and show you how to solve forensic science problems in R. The book makes some assumptions about you: This book free and is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 United States License. R Core Team. 2019. R: A Language … Read more →

# R Programming for Data Science

## by Roger D. Peng

The R programming language has become the de facto programming language for data science. Its flexibility, power, sophistication, and expressiveness have made it an invaluable tool for data scientists around the world. This book is about the fundamentals of R programming. You will get started with the basics of the language, learn how to manipulate datasets, how to write functions, and how to debug and optimize code. With the fundamentals provided in this book, you will have a solid foundation on which to build your data science toolbox. Read more →

# Introduction to R

## by Filip Wästberg

This book contains workshop material for 4 workshop held at tele2 during the autumn of 2019 […] During the workshops you will use R on both a cloud platform and on your own computer. If you have a previous R version installed on your computer we recommend that you update both R and RStudio. This book is written in R with the package bookdown built by Yihui Xie. You can find out more about bookdown … Read more →

# Men’s U-Sports Basketball Analysis

## by Michael Armanious

Sports Analytics […] Statistics in sports is a growing field in research that provides specialized methodology for collecting and analyzing sports data in order to make decisions for successful planning and implementation of new strategies [1]. Sports, particularly, have countless and ever-expanding data sources that can be used by analysts in order to extract objective information for use in aspects such as making predictions throughout seasons and enhancements in team and player performance. Broadly, Sports Analysis is described as the process of data management, predictive model … Read more →

# tmdlTools Guide

## by Elise Hinman

A step by step guide to using the R package, tmdlTools […] This guide walks watershed coordinators through installing R and its associated packages on their computer. If you still have questions about running R or the tmdlTools app, Elise Hinman is available and happy to assist you. … Read more →

# 2 Datenherkunft | Making maps with R

## by Nico Hahn

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Ein Großteil der Daten, die in dieser Arbeit verwendet wurden, stammen aus OpenStreetMap. Je nach Größe wurden diese entweder mit der beigefügten OSM-App erstellt, oder von https://www.geofabrik.de/ heruntergeladen. Bei Zweiterem wurden die Datensätze mit den command line Tools osmconvert und osmfilter in ein sinnvolles Format konvertiert und gefiltert. Danach war nocheinmal eine weitere Transformation in das GEOJSON Format nötig, wofür das NodeJS Package … Read more →

# Applications of Machine Learning in Imputation

## by Vinayak Anand-Kumar

This document presents the findings from the 2018/19 project into the use of machine learning in imputation. […] I would like to acknowledge the following people in helping produce this report: Emily Tew and Gareth Clews for their guidance and support, in getting XGBoost up and running. Fern Leather, for getting CANCEIS to work. Really grateful for taking the time to run through the specification files with me. Luke Lorenzi and Vahe Nafilyan, for helping me put the pieces together, and helping me figure out how we can progress this work in the context of survey data. Editing and … Read more →

# Technical Foundations of Informatics

## by Michael Freeman and Joel Ross

The course reader for INFO 201: Technical Foundations of Informatics. […] Announcement: Starting in 2019, readings for the INFO 201 course will come from the textbook Programming Skills for Data Science, which is available to UW students for free via SafariBooksOnline or in print. Unless specifically directed to a section of this online text, you should refer to the Programming Skills for Data Science textbook. This book covers the foundation skills necessary to start writing computer programs to work with data using modern and reproducible techniques. It requires no technical background. … Read more →

# LDC Walk Through

## by E. Hinman

A step by step guide to calculating loading and building load duration curves […] The loading calculation function is called tmdlCalcs(). It requires a workbook with specific tabs and headers, as well as a command outlining whether the output should be exported from the function. We’ll define the parameters (and load the packages) in this bookdown so you can see the steps in real time. The first step is to define the output object (a list of dataframes), and some of the equations needed. The next step is to read in the workbook and define the parameters contained in the input tab. Then, we … Read more →

# Data Analytics

## by Hans van der Zwan

Course notes Data Analytics course at The Hague University of Applied Sciences; lecturer J.H. van der Zwan […] Ismay C. & Kim A. Y. (2019). ModernDive. Statistical Inference for Data Science. https://moderndive.com. Rumsey D. J. (2010). Statistical Essentials for Dummies. Hoboken: Wiley … Read more →

# e-Business

## by Robert Batzinger

This is a minimal example of the book I am trying to write. The output format for this example is bookdown::gitbook. […] This book attempts to introduce undergraduate students to the nature and requirements for conducting business online. It starts with a discussion of the nature of business and the challenges and potential of the online environment, followed by a review of common methods of modelling business, and a study of open source business solutions. The final chapter focuses on emerging trends and sea-changes in e-Business. This book is currently a work in progress that is also … Read more →

# MSU I-O Student Mentorship Program User Manual

## by Eagle I-O

This is a Users Manual for the Montclair State University I-O Psychology student mentorship program. The intended users are: 1) mentors, 2) protege’s, 3) Eagle I-O consultants, and 4) MSU I-O program faculty members. […] This manual was written in Bookdown using the GitBook … Read more →

# An Introduction to Game Theory

## by Yuleng Zeng

This is an introduction to Game Theory. The project started when I sat in on Tobias Heinrich’s class (POLI 725: International Conflict) in Fall 2019. I was given the opportunity to provide an introduction to basic game theory concepts and methods. Thank again to Toby for the trust and the opportunity. My intention is to build upon the short introduction and potentially expand it into course, with a heavy focus on models used in International Relations. If you have suggestions or find any errors, please do shoot me an … Read more →

# Estadística Básica Edulcorada

## by Alejandro Quintela del Rio

Estadística y probabilidad básica, con aplicaciones y elementos históricos. […] Advertencia: Libro en fase de elaboración. No se recomienda copiar trozos, puesto que después podría haber lloros si hay acusaciones de plagio. La estadística para gente inteligente. Este libro está bajo licencia Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Los paquetes o librerías que se utilizan en este libro son las que siguen. Para ejecutar trozos de código particulares no habría que instalar todas, pero, si en algún momento la ejecución de algún ejemplo da error, podría … Read more →

# Exploratory Analysis Project

## by Matheus Amaral Mões, Marcelo Semerene Farah, Luísa Belus Henriques and Daniela de Góes N. Georg

This project is part of a lesson from FGV’s MBA on Business Analytics and Big … Read more →

# 国内任天堂 Switch 使用情况与玩家需求调查报告

## by OldNin

国内任天堂 Switch 玩家的小调查。 […] Nintendo Switch™1是日本任天堂公司出品的电子游戏机，于2017年3月3日发售。2019年4月26日，腾讯宣布将与任天堂合作，在中国代理发售任天堂 Switch 主机。28月2日，腾讯携手任天堂的 Nintendo Switch 首度亮相第十七届中国国际数码互动娱乐展览会（ChinaJoy 2019），并于上午在上海浦东香格里拉大酒店举办了主题为“奇乐同享，‘任’式好游戏”的媒体见面会。3 2019年8月12日，我通过个人微博、百度贴吧（NS吧和Switch吧，帖子不幸被隐藏）、QQ群、微信群、抽屉新热榜（帖子有幸被小编推上热榜）和IT之家（获得几个用户回复）这几个平台发放了问卷填写链接。8月13日，问卷填写链接被知任CHS、NS新闻速报和黑桐谷歌等媒体和大博主转发后，问卷回 … Read more →

# Introduction to Econometrics with R

## by Christoph Hanck, Martin Arnold, Alexander Gerber and Martin Schmelzer

Beginners with little background in statistics and econometrics often have a hard time understanding the benefits of having programming skills for learning and applying Econometrics. ‘Introduction to Econometrics with R’ is an interactive companion to the well-received textbook ‘Introduction to Econometrics’ by James H. Stock and Mark W. Watson (2015). It gives a gentle introduction to the essentials of R programming and guides students in implementing the empirical applications presented throughout the textbook using the newly aquired skills. This is supported by interactive programming exercises generated with DataCamp Light and integration of interactive visualizations of central concepts which are based on the flexible JavaScript library D3.js. Read more →

# Polar-ICE Sci-I Project Development & Implementation Guide

## by Kristin Hunter-Thomson and Jacoby Baker

Lessons learned from running the Polar-ICE Sci-I Project for others to take and run with in their locations! […] Welcome to our Science Investigations (Sci-I) Project Development & Implementation Guide. This document provides an overview of how we developed and implemented the Sci-I Project from 2015-18 through the Polar-ICE grant funded by the National Science Foundation Polar Division (Grant #PLR-1525635). Because our funding source was related to the polar regions, we had a polar emphasis of content throughout our implementation of the project. However, the Sci-I Project can be run with … Read more →

# R for Geospatial Processing

## by Nicolas Roelandt

This is the training materials for the R for Geospatial Processing workshop at FOSS4G 2019, Bucarest (Romania). […] This workshop is designed for the attendance of FOSS4G 2019. So basics knowledge in GIS is expected (simple features, projections and CRS, geometrical operations, etc.). No knowledge of R is required. A minimal knowledge of (R)mardown will be a plus to take notes. Please install R on your system and the following libraries. Please follow installation instructions from the CRAN projet. The {sf} library needs several geospatial core libraries (GDAL, GEOS, PROJ) so please follow … Read more →

# 3 Dashboard for students | LASI 2019 - Visualization meets Learning Analytics

## by karepin13

3 Dashboard for students | LASI 2019 - Visualization meets Learning Analytics […] Let’s construct a prototype of dashboard for one of our students. Discussion: How to show student’s perfomance? Should we show students their predicted chances for success? On the graph below we show the score for each assesment aside with the average score of other students. We can highlight some of the student’s results which have notable differences with the average score. Whether it will be usefull or motivating for the student? Maybe we should show that graph to the teacher or assistant (in cases … Read more →

# Analyzing single-case data with R and scan

## by Jürgen Wilbert

This is a book on analyzing single-case data but also on how to do this using the R package scan […] … Read more →

# Quantitative Research Methods for Political Science, Public Policy and Public Administration: 4th Edition With Applications in R

## by Hank Jenkins-Smith, Joseph Ripberger, Gary Copeland, Matthew Nowlin, Tyler Hughes, Aaron Fister, Wesley Wehde, and Josie Davis

Quantitative Research Methods for Political Science, Public Policy and Public Administration: 4th Edition With Applications in R […] The idea for this book grew over decades of teaching introductory and intermediate quantitative methods classes for graduate students in Political Science and Public Policy at the University of Oklahoma, Texas A&M, and the University of New Mexico. Despite adopting (and then discarding) a wide range of textbooks, we were frustrated with inconsistent terminology, misaligned emphases, mismatched examples and data, and (especially) poor connections between the … Read more →

# Introduction to Time Series Analysis and Forecasting in R

## by Tejendra Pratap Singh

Scripts from the online course on Time Series and Forecasting in R. […] Selecting the model. Due to seasonality involved, simple models will not be able to capture it. We therefore use the seasonal ARIMA and exponential smoothing models. Exponential smoothing models have seasonality built in it by construction. Complex models like mixed models and neural nets will be an overkill. … Read more →

# Descenso Internacional del Sella

## by Sergio Berdiales

En este libro voy publicando mis notas sobre el Descenso Internacional del Sella. De momento solo he explorado los tiempos de los ganadores absolutos de la prueba, es decir, del K2 … Read more →

# How to Build a Shiny Application from Scratch

## by Hadrien@rstudio.com

How to Build a Shiny Application from Scratch […] Shiny is a powerful R package which allows you to create interactive web applications using the R programming language. It is particularly useful for creating applications that run on data and include some sort of data analysis or visualization. In addition to leveraging the power of R and its thousands of packages, one of the big benefits of shiny is the ease of developing applications using R only. Although it is possible to incorporate more traditional web design languages such as custom CSS or Javascript into your shiny application, it … Read more →

# A Guide to Reproducible Research

## by Callum Arnold

This is a book that provides the foundations for good project structure and organisation. It guides you in what reproducible research is, and how we can implement it. If you would like to contribute to, and expand upon, sections, please submit a pull request on the GitHub Repo. Equally, please submit pull requests if you spot a typo or a mistake! […] This book’s focus is on how to produce reproducible research, and should serve as an introduction to data management and project organisation. Through the course of this document, we explain techniques that can be employed easily to help add … Read more →

# AWS Tutorial

## by Bingwei Liu

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. … Read more →

# R Installation Guide

## by Integrated Report Team

A step by step guide to installing R and the necessary R packages needed to perform secondary reviews. […] This guide walks secondary reviewers through installing R and its associated packages on their computer. A snapshot of the commands required to run the R packages is available on the “Install or Access R Tools” card on the Current Report Development Trello Board. Please refer to the FAQs section for answers to common questions, but note that this section will grow as we move through the process as group. If you still have questions about running R or the secondary review apps, the … Read more →

# Dokumentasjon av utdanningsfaglig kompetanse

## by Tormod Bøe Førsteamanuensis

Dokumentasjon av utdanningsfaglig kompetanse […] I kapitlene indikerer at teksten er hentet fra “Dokumentasjonskrav fra UiO”. https://www.uio.no/om/organisasjon/utvalg/utdanningskomiteen/moter/2018/mote-nr-10/rapport-fra-arbeidsgruppen-for-merittering-med-vedlegg.pdf https://www.uhr.no/temasider/karrierepolitikk/opprykksordninger/ https://www.uhr.no/_f/p1/i667c848f-8845-47e9-b8b6-fddfbde6782d/nasjonale_retningslinjer_vurdering_opprykk_professor_psykologi_npp_juni_2015.pdf https://lovdata.no/dokument/SF/forskrift/2018-09-12-1322 … Read more →

# Eagle I.O Consultant Guidelines

## by Eagle I.O

This is the student guideline manual that describes expectations and responsibilities of Eagle I.O consultants. […] This manual was written in Bookdown using the GitBook … Read more →

# Tutorial SIG pour le diagnostic territorial

## by Tristan Berchoux (CIHEAM-IAMM)

Tutoriel SIG pour le diagnostic territorial IAMM. […] Suite aux cours magistraux, vous allez maintenant prendre en main un logiciel de “Système d’Information Géographique” libre et open source : QGIS. Tout au long du tutoriel, vous allez devoir effectuer des manipulations qui seront différenciées par un fond gris avec des questions correspondantes auxquelles vous devez répondre. Les données nécessaires pour effectuer le tutoriel sont à télécharger ICI. Il vous est recommandé de sauvegarder votre projet (sous un nouveau nom) régulièrement Menu Projet → Sauvegarder sous… Les ordinateurs de … Read more →

# OcFund QGG海内外基金调研

## by 施旸 Yvette

2019 Intern Report Collection […] 你好，世界。 … Read more →

# Introductory Resources: Statistics and R

## by Statistics Team, PPLS

This is the main page of the course and contains the materials to help you going with R […] This course is designed for those who will be joining a third year Research Methods and Statistics (RMS) course and covers a number of introductions to topics which are core to statistical analysis in psychology and beyond. You will find here an introduction to R as a tool to analyse data, visualize it and to use it for a very very basic analysis of the relationships in your data. It will further revise some of the most commonly used statistical tests and provide you with a guidance how to set up and … Read more →

# An R Exercise in Data Collection, Cleaning, and Merging U.S. Census Data

## by Sean Conner

An R Exercise in Data Collection, Cleaning, and Merging U.S. Census Data […] This document is intended as a follow-along tutorial for learning how to perform data collection and cleaning with R. To the best of my ability, I have tried to make this illustrative of real data and real tasks that anyone from a social science student to a county government official might actually encounter. To that end, I am building upon actual projects that I have worked on as a graduate research assistant to convey this information. For context, previously, I conducted a Mississippi case study of how indoor … Read more →

# Learn RDataTable

## by Vikram Singh Rawat

This book is a guide to rich world of RDataTable […] R is Already a Slow Language please don’t defame it by using even slower packages. … Read more →

# DJing to Dolphins

## by Ian K Salter

The 2017 tales of a voyage sailing away from Brexit Britain. […] The tales of a 2017 voyage sailing away from Brexit Britain reflecting on fake news with the help of the philosophies of science and mathematics. To Natalie - for once upon a time, on the banks of the Thames, encouraging me to keep writing. Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. © Ian K Salter 2017, 2018, … Read more →

# Agile Machine Learning with R

## by Edwin Thoen

A workflow for doing machine learning in the R language, using Agile principles. […] Not even too long ago, when I was starting my career as a data scientist, I did not really have a workflow. Freshly graduated from an applied statistics master I entered the arena of Dutch business, employed by a small consulting firm. Neither the company I was with, nor the clients I was working for, nor myself had an understanding of what it meant to implement a statistical model or a machine learning method in the real world. Everybody was of course interested in this “Big Data” thing, so it did not take … Read more →

# Introduction to Data Exploration and Analysis with R

## by Michael Mahoney

A detailed introduction to coding in R and the process of data analytics. Version 1.0.0 […] Welcome to Introduction to Data Exploration and Analysis in R (IDEAr)! This book is designed as a crash course in coding with R and data analysis, built for people trying to teach themselves the skills needed for most analyst jobs today. You won’t need any past experience with R or data analytics - the aim of the book is to work as a primer for people of all backgrounds. This book is currently being continuously deployed to bookdown.org and GitHub while editing continues. This is so that I can get … Read more →

# Data Analysis and Processing with R based on IBIS data

## by Kevin Donovan

Data Analysis and Processing with R based on IBIS data […] Over the course of my time working with the Carolina Insitute for Developmental Disabilities (CIDD) and the Infant Brain Imaging Study (IBIS) network, I have seen a great interest in learning how to do basic statistical analyses and data processing among the trainees. Specially, there is an interest in learning how to use R, due to its popularity across the sciences and its zero financial cost. As a statistican in training, I feel it is a great benefit for scientists to learn R. It is vital for scientists to understand the … Read more →

# jamoviで学ぶ心理統計

## by Danielle J Navarro & Dvid R Foxcroft（著） 芝田征司（訳）

『jamoviで学ぶ心理統計』は心理学専攻の統計法入門クラス向けのテキストです。本書では，jamoviの使い方やデータ操作の方法についても扱います。統計の部分では，記述統計とグラフの作成について扱った後，確率理論，標本と推定，帰無仮説検定について説明します。理論についての説明の後は，分割表の分析，相関，t検定，回帰，分散分析について説明します。本書の最後では，ベイズ統計についても取りあげます。This book is a Japanese translation of learning statistics with jamovi. […] 本書はDavid Foxcroft氏が作成した『Learning … Read more →

# Common statistical tests are linear models: a work through

## by Steve Doogue

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a reworking of the book Common statistical tests are linear models (or: how to teach stats), written by Jonas Lindeløv. The book beautifully demonstrates how many common statistical tests (such as the t-test, ANOVA and chi-squared) are special cases of the linear model. The book also demonstrates that many non-parametric tests, which are needed when certain test assumptions do not hold, can be approximated by linear models using the rank of values. … Read more →

# An Introduction to R, LaTeX, and Statistical Inference

## by Yuleng Zeng

An introduction to R for political scientists. […] This is an introduction to R and Latex. In compiling this documents, several sources have been consulted, including Tim Peterson’s website, Havard’s Math Prefresher, and the course offered by DataCamp. Make sure that you have a laptop throughout this introduction. Install the following applications, if you haven’t done so. Finally, this document is to be used in-class only. As I (will) mention several times, it borrows and merges a lot of resources online. Also, if you see any mistakes or have suggestions, please do shoot me an … Read more →

# Decision-Driven Data Analytics for Well Placement Optimization in Field Development Scenario - Powered by Machine Learning

## by Peyman Kor

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Submitted in accordance with the requirements for the degree of Master of Science (M.Sc)in Petroleum EngineeringUniversity of Stavanger, Energy Resources Department The data, source code and algorithem of this thesis can be found in the author’s Github. Your feedback and comments will be appreciated and the author could be reached out via Linkedin, twitter. This thesis is licensed under Attribution-NonCommercial-ShareAlike 4.0 … Read more →

# RStudio para Estadística Descriptiva en Ciencias Sociales

## by Giorgio Boccardo Bosoni y Felipe Ruiz Bruzzone

RStudio para Estadística Descriptiva en Ciencias Sociales […] En su segunda edición este libro fue editado en RStudio mediante RMarkdown y compilado usando el paquete Bookdown. Su ejecución se efectuó en una distribución del sistema operativo Linux de tipo Mint, específicamente en su actualización 19.1 “Tessa” y variante Cinnamon Edition. Para evitar algunos problemas derivados de la actualización del kernel de Linux se utilizó el software R en su versión 3.4.4, aunque a la fecha de su publicación, R ya había lanzado a su versión 3.6. Para evitar incongruencias entre algunas dependencias de … Read more →

# Fatal Force Study Group - Shiny App

## by marwaelatrache

Fatal Force Study Group - Shiny App […] The Fatal Force Study Group (FFSG) was founded at the University of Washington (UW) by professor Martina Morris. Morris has a strong background in Sociology and Statistics and after joining an activism group called Not This Time she decided to start investigating fatal encounters with police, along with a group of UW undergraduate students. Since then the group has been joined by Professor Ben Marwick, an UW archaeology professor with a strong interest in statistics and R, as well as several more undergraduate students from both UW and neighboring … Read more →

# Time Series Midterm Review

## by David Josephs

A hopefully helpful guide … Read more →

# PPLS PhD Training Workshop: Statistics and R

## by Anastasia Ushakova and Emma Waterston

This is the main page of the course and contains a course overview, schedule and learning outcomes. […] During this intensive workshop we will cover a number of introductions to topics which are core to statistical analysis in applied research. This will include introduction to R as a tool to analyse data, visualize it and to use it for a very very basic analysis of the relationships in your data. We will further revise some of the most commonly used statistical tests and provide you with a guidance how to set up and interpret them in R. We will introduce you to simple linear model and … Read more →

# 課程大綱 | ntpu-data-visualization.utf8.md

## by tpemartin

經濟資料視覺化處理 […] This course is designed to develop the skill of efficient graphic language, where efficiency is defined as the data information delivery that is self-contained, concise, and non-distorting. The programming language is mainly based on R, with a little bit of Javascript toward the end. Though there is no computer programming knowledge required, basic R knowledge will help (the ebook, R for Data Science, would be a good start). By the end of the course, students who learn well should be able to … Read more →

# Feature Engineering and Selection: A Practical Approach for Predictive Models

## by Max Kuhn and Kjell Johnson

A primary goal of predictive modeling is to find a reliable and effective predic- tive relationship between an available set of features and an outcome. This book provides an extensive set of techniques for uncovering effective representations of the features for modeling the outcome and for finding an optimal subset of features to improve a model’s predictive performance. […] A note about this on-line text: This book is sold by Taylor & Francis Group, who owns the copyright. We will be updating this version as we find errors or typos (see the Errata). The physical copies are sold by Amazon … Read more →

# Hackathon Talento - Reto 2 - Wind Farm

## by Sergio Berdiales, Javier Campos y Manuel Antonio García

Hackathon Talento - Reto 2 - Wind Farm […] Este notebook nace de nuestra participación el 4 de junio de 2019 como equipo en el Hackathon de Machine Learning organizado por Talento Corporativo y patrocinado por EDP, El Comercio, Clustertic y BigML. La competición consistió en el planteamiento de un par de retos de Machine Learning basados en datos de EDP y en los que había que utilizar la herramienta BIGml para ejecutar los modelos. El contenido de este notebook corresponde a la realización del segundo reto, cuyo planteamiento se describe en el apartado uno. Durante la competición la mayor … Read more →

# Hackathon Talento - Reto 1 - SUNLAB

## by Sergio Berdiales, Javier Campos y Manuel Antonio García

Hackathon Talento - Reto 1 - SUNLAB […] Este notebook nace de nuestra participación el 4 de junio de 2019 como equipo en el Hackathon de Machine Learning organizado por Talento Corporativo y patrocinado por EDP, El Comercio, Clustertic y BigML. La competición consistió en el planteamiento de un par de retos de Machine Learning basados en datos de EDP y en los que había que utilizar la herramienta BIGml para ejecutar los modelos. El contenido de este notebook corresponde a la realización del primer reto, cuyo planteamiento se describe en el apartado uno. Durante la competición la mayor parte … Read more →

# 課程大綱 | ntpu-programming-for-data-science.utf8.md

## by tpemartin

資料科學程式設計（一） […] 電子書網址：https://bookdown.org/tpemartin/ntpu-programming-for-data-science/電子書加個人註記：https://via.hypothes.is/https://bookdown.org/tpemartin/ntpu-programming-for-data-science/ gitter chatroom: https://gitter.im/ntpuecon/course-program-for-data-science107-2 This course is to build the foundation for being a data scientist–who masters both data analysis and data engineering. There are two programming languages that will be taught through the course: R and Javascript. R will serve as the data analysis backend, while … Read more →

# 數量方法（一）

## by 林茂廷老師

數量方法（一）電子書 […] 授課老師： 林茂廷 辦公室：社科大樓3F01 諮詢時間：TBA 電話：02﹣86741111轉67170 Email：mtlin@gm.ntpu.edu.tw … Read more →

# 空间广义线性混合效应模型及其应用

## by 黄湘云

Spatial generalized linear mixed models, Stationary Spatial Gaussian Process, Stan platform, Markov chain Monte Carlo. […] 空间统计的内容非常丰富，主要分为地质统计 （geostatistics）、 离散空间变差 （discrete spatial variation） 和空间点过程 （spatial point processes） 三大块 (Cressie 1993)。 地质统计这个术语最初来自南非的采矿业 (Krige 1951)， 并由 Georges Matheron 及其同事继承和发展，用以预测黄金的矿藏含量和质量。空间广义线性混合效应模型 （Spatial Generalized Linear Mixed Model，简称 SGLMM） 在空间统计中有着广泛的应用，如评估岩心样本石油含量，分析核污染物浓度的空间分布 (Diggle, Tawn, and … Read more →

# University of Calgary ARC Manual

## by Naomi J. Goodrich-Hunsaker, Ph.D.

University of Calgary ARC Manual […] This manual contains all of the code developed to run neuroimaging programs on the University of Calgary Arc high performance computing systems. If you need more general information or further clarification, visit my website at http://biabl.com or email me at naomi.hunsaker@utah.edu. … Read more →

# Proyecto Final: Suicidios en México y EU

## by Chávez Mañón Tania Nayeli, Mendoza Suárez Brenda Grisel, Vargas Mendoza Ana Luisa

Proyecto Final: Suicidios en México y EU […] La motivación de este proyecto es conocer cuáles podrían ser los factores más significativos que llevan a una persona al suicidio para poder prevenirlo. Country: País. Categorías: Mexico, United States Year: Año. Categorías: 1985:2015 Sex: Género. Categorías: female, male Age: Edad. Categoriías: 5-14 years, 15-24 years, 25-34 years, 35-54 years, 55-74 years, 75+ years Suicides: Número de suicidios. Population: Población. HDI: Human development index (Índice de Desarrollo Humano) GDP_PP: Gross domestic product per Capita (Producto Interno Bruto) … Read more →

# RAP Guide for ONS

## by Joshua & Catrin

This is a guide for RAP at the ONS […] The aim of this guide to provide a guide for RAP at the ONS. This is meant to be a more comprehsive guide for RAP at the ONS aimed at those newer to coding and is a suppliment to the RAP Companion and RAP course on Udemy. These materials are excellent and provide an in-depth look at RAP in R. what the point of the guide, it’s relationship to the RAP course/companion any conventions defintions Note: we refer to our package on GitLab as a ‘project’ throughout, however this can be interchanged with the term ‘repository’. Terminology - We refer to Git … Read more →

# Lecture Notes voor Beleidsinformatica

## by B. Depaire

Dit zijn de lecture notes van het opleidingsonderdeel Beleidsinformatica […] Dit document bevatten de lecture notes voor het opleidingsonderdeel Beleidsinformatica (3512), gedoceerd aan de Universiteit Hasselt. Ieder hoofdstuk dient ter ondersteuning van een van de hoorcolleges en bevat zowel een samenvatting in “bullet-point” stijl alsook een verzameling bronnen op basis waarvan het hoorcollege is opgebouwd. We raden aan om deze lecture notes steeds kort na het hoorcollege door te nemen en aan te vullen met je eigen notities uit het college. Ook raden we aan de bronnen te raadplegen voor … Read more →

# Preguntas entrevistas Data Science

## by Sergio Berdiales

Preguntas entrevistas Data Science […] En estas notas trato de responder a diferentes preguntas que un candidato para una posición de Data Scientist se puede encontrar en una entrevista. Muchas de las preguntas vienen directamente de artículos sobre este tema específico (enlaces en la sección ‘02-Referencias’), otras de mi experiencia personal y otras de aportaciones de otras personas. Aquí enlazo una Google sheet con las preguntas que voy recopilando. Si tienes alguna pregunta interesante y quieres añadirla al listado, adelante. Este es el repositorio en github: https://github.com/sergiober … Read more →

# Utah DWQ’s irTools R package: An automated approach to state-wide water quality assessment

## by Jake Vander Laan (jvander@utah.gov), Elise Hinman, & Emilie Flemer, Utah Division of Water Quality

Utah DWQ’s irTools R package: An automated approach to state-wide water quality assessment […] This document provides a background and demonstration of the Utah DWQ IR Team’s re-development and automation of water quality assessment tools. This document consists of three components: 1. A background section describing the objectives, tools, and approach to developing new water quality assessment tools. 2. A full-scale demonstration of the current state of this new toolset via the application of these tools to a subset of water quality parameters from the 2016 IR period of record dataset … Read more →

# Applications of Machine Learning in Imputation

## by Methodology

This document presents the findings from the 2018/19 project into the use of machine learning in imputation. […] Editing and imputation are both methods of data processing. Editing refers to the detection and correction of errors in the data, whilst imputation is a method of correcting errors in a dataset. This document presents findings from work carried out at the Office for National Statistics on the use of machine learning in imputation. The chapters address the following … Read more →

# Juego de Tronos - Explorando sus datos

## by Sergio Berdiales

Juego de Tronos - Explorando sus datos […] El objetivo de este libro de bookdown es simplemente jugar un poco con los datos de la serie de televisión Juego de Tronos (HBO). Todo el código y los datos empleados se encuentran en este repositorio de Github https://github.com/sergioberdiales/game_of_thrones. Cualquier consulta, queja o sugerencia me la puedes enviar vía twitter twitter.com/SergioBerdiales … Read more →

# From Madrid to Santiago de Compostela, 2019

## by Robin and Katy

Photobook of our trip from Madrid to Santiago via Salamanca, Ourense and the Camino de Compostela. […] Welcome to our photobook of our travels through Spain in May … Read more →

# Monte Carlo Simulation Examples

## by Mark Lai

Handout for the workshop ‘Advancing Quantitative Science with Monte Carlo Simulations’. […] We know that, based on the CLT, under very general regularity conditions, when sample size is large, the sampling distribution of the sample mean will follow a normal distribution, with mean equals to the population mean, (\mu), and standard deviation (which is called the standard error in this case) equals the population SD divided by the square root of the fixed sample size. Let (\bar X) be the sample mean, then [\bar X \sim \mathcal{N}\left(\mu, \frac{\sigma^2}{N}\right)] Let’s imagine a … Read more →

# R 数据分析指南与速查手册

## by 郭晓

这是郭晓的R数据分析笔记本。 […] 郭晓，北京大学电子学2018届硕士。数据分析从业者，R语言、机器学习持续精进学习中。爱好知识的整理、挖掘与应用。 联系方式：xiaoguodata@126.com … Read more →

# Seeing through the developping lens:

## by Paul Langard

Seeing through the developping lens: […] Through this project, we aim to decipher post-transcriptional regulation network in the developping lens. In the past decades, post-transcriptional gene regulation (PTGR) was shown to be of particular importance in the developping lens. Indeed, the alteration of PTGR network can result in abnormal development of the lens, of the eye. For example, mutations in RNA binding proteins such as Celf1, Stau2, Tdrd7 has been associated to eye’s defects in animal models. mutation in RNA binding protein Tdrd7 was associated with juvenile cataract in human and … Read more →

# Overview of suicide in the world

## by Axel-Cleris Gailloty

Overview of suicide in the world […] According to the WHO Suicides organization, 800.000 committed suicide in 2018. This means every 40 seconds a person dies by suicide. This number is fortunately dropping. In this kernel I want to explore the evolution of suicide rate using the dataset provided here on Kaggle. I’ll be using the powerful R language to do this analysis, my main focus is to understand what affects the suicide rate to decrease. Let’s start by loading the packages we’ll be using throughout this study. Before we go further in this analysis, it is important to know what each … Read more →

# Calidad del aire en Gijón

## by Sergio Berdiales

Calidad del aire en Gijón […] Los objetivos principales de este proyecto son realizar análisis y visualizaciones de los datos de la estaciones oficiales de monitorización de la calidad del aire de la ciudad de Gijón. Este proyecto es hermano de este otro https://bookdown.org/sergioberdiales/tfm-kschool_gijon_air_pollution/, que fue mi trabajo final del Máster de Data Science en Kschool (por eso hay algunas partes del código comentadas en inglés). En él, además de tratar los datos y realizar distintos ejercicios de visualización de los mismos (ver visualizaciones en Tableau Public), realicé … Read more →

# Основы обучаемых алгоритмов интеллектуальных систем

## by Митрохин Максим Александрович

Учебно-методическое пособие включает набор лабораторных работ по созданию алгоритмов машинного обучения для решения практических задач. В издании содержится необходимый набор теоретических сведений по методологии анализа данных и используемых алгоритмах. Выполнение работ предполагает использование языка программирования Python 3.5. Лабораторный практикум подготовлен на кафедре «Вычислительная техника» и предназначен для обучающихся по направлениям подготовки 09.03.01, 09.04.01, изучающих дисциплины «Основы интеллектуальных систем», «Интеллектуальные … Read more →

# The Good Loser

## by Peter Esaiasson, Sveinung Arnesen, and Hannah Werner

The Good Loser […] This is the analysis report for the Good Loser Project by Peter Esaiasson, Hannah Werner, and Sveinung Arnesen. The study comprises three survey embedded experiments; one video vignette experiment in Norway, one text vignette experiment in Sweden, and one conjoint experiment in Norway. The study has been presented at the Barcelona-Gothenburg-Bergen workshop on Experiments in Political Science in 2018, and will be presented at the 2019 Conference of the Midwestern Political Science Association in Chicago, USA. About Study I – Swedish vignette: TBA About Study II – … Read more →

# R for marketing students

## by KU Leuven Marketing department

KULeuven R tutorial for marketing students […] In this tutorial, we will explore R as a tool to analyse and visualise data. R is a statistical programming language that has rapidly gained popularity in many scientific fields. The main difference between R and other statistical software like SPSS is that R has no graphical user interface. There are no buttons to click. R is run entirely by typing commands into a text interface. This may seem daunting, but hopefully by the end of this tutorial you will see how R can help you to do better statistical analysis. So why are we using R and not one … Read more →

# Lab Guide to Quantitative Research Methods in Political Science, Public Policy & Public Administration.

## by josiesmith

Lab Guide to Quantitative Research Methods in Political Science, Public Policy & Public Administration. […] This book is a companion to Quantitative Research Methods for Political Science, Public Policy and Public Administration (With Applications in R): 4th Edition, an open-source text book that is available here. It grew from our experiences teaching introductory and intermediate quantitative methods classes for graduate students in Political Science and Public Policy at the University of Oklahoma. We teach these courses using a format that pairs seminars on theory and statistics with … Read more →

# DSBA-5122 Final Project

## by Nicholas Occhipinti, Karyn Cook, Ziyin Liu

The final report for DSBA-5122 Final Project […] For our project we explored data related to opioids, in an effort to better understand and obtain insight into the opioid epidemic. Our domain problem is one for a researcher wanting to explore the connection between prescriber rates of opioid prescriptions and opioid-related deaths both in the country as a whole and drilling down to the state level. The first part of the data we examined was prescriber data. This data would allow the researcher to see the distribution of opioid prescriptions across the US and also find the most commonly … Read more →

# DWQ’s irTools package: An automated approach to water quality assessment

## by Jake Vander Laan, Elise Hinman, & Emilie Flemer, Utah Division of Water Quality

DWQ’s irTools package: An automated approach to water quality assessment […] This book provides a background and demonstration of the Utah DWQ IR Team’s re-development and automation of water quality assessment tools. This book consists of two components: 1. A background section describing the objectives, tools, and approach to developing new water quality assessment tools, and 2. A full-scale demonstration of the current state of this new toolset via the application of these tools to the 2016 IR period of record dataset (2008-2014). The source code for this book is available via GitHub … Read more →

# Tank Guide

## by Marina Wiebe

This is a book regarding how to take care of my tank […] So, you’ve been tasked with taking care of your girlfriend’s hobby tank. It’s a pretty thing and it looks easy enough, but what’s all involved? This book will give you an idea of the tank buddies, tools, and … Read more →

# Building Web Applications with Shiny and SQL Server

## by Matthew Sharkey

A guide to building scalable Shiny Datbase applications […] This book supplements my presentation at the Omaha R User Group on Thursday, April 4, … Read more →

# Dissertating with RMarkdown and Bookdown | dissertating_rmd_presentation.utf8.md

## by thea_knowles

A preliminary tutorial led by Thea Knowles for the R-Ladies #LdnOnt workshop series Last updated: … Read more →

# Applied Social Network Analysis in Education

## by chen

This is a course handbook written by Bodong Chen for his SNA course at UMN. […] This site is the course portal of CI 8371 - Applied Social Network Analysis in Education, taught by Prof. Bodong Chen at the University of Minnesota in Spring ’19. Content on this site is actively built and refined throughout the semester. This site or book is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Last update: 2019-04-23 … Read more →

# Techincal Analysis with R

## by Ko Chiu Yu

This is an introductory textbook that focuses on how to use R to do technical analysis. […] R is widely used in statistical computation. It is well-suited to do computationally heavy financial analysis. In particular, evaluating performance of trading rule based on technical indicators. Moreover, R can be one-stop solution to the whole procedure of data analysis. A standard procedure of financial data analysis is: You can do all of them inside R without using other software. This short book is a short introduction on how to use R and RStudio to do financial data analysis from the beginning. … Read more →

# MODELING MELODIC DICTATION

## by David John Baker

This dissertation models both individual and musical features that contribute to processes involved in melodic dictation. […] All students pursuing a Bachelor’s degree in Music from universities accredited by the National Association of Schools of Music must learn to take melodic dictation (NASM 2019, sec. VIII.6.B.2.A). Melodic dictation is a cognitively demanding process that requires students to hear a melody, then without any access to an external reference, transcribe the melody within a limited time frame. As of 2019, there are 643 Schools of Music belonging to National Association of … Read more →

# The twinetverse

## by John Coene

A guide to visualise networks of Twitter interactions in R using the twinetverse. […] The goal of the twinetverse is to provide everything one might need to analyset and visualise Twitter interactions, from data collection to visualisation. The following pages will walk you trough the packages contained within the twinetverse, from collecting twitter data to building various types of networks to visualising them. The ’verse focuses on ease of use and interactivity. The source code for this book can be found on Github. You can suggest edits to this book by highlighting a section of text and … Read more →

# jamoviクイックガイド（Analysis編）

## by 芝田 征司

本書はjamoviのAnalysisモードの簡易ガイドです。jamoviのAnalysisモードに含まれるメニュー項目およびその設定項目について，簡単な説明を加えています。 […] 本書はjamoviのAnalysisモードを使用するための簡易ガイドです。メニューに含まれる項目やその設定項目について，簡単な説明を加えています。本書の内容の大部分は，jamoviと同じ分析をRで実行するためのパッケージ「jmv」のリファレンスをもとにしています。 本書は統計法の教科書としてではなく，（英語の苦手な学生たちが）jamoviを使って分析をする際のヘルプとして作成されています。統計そのものについてはすでに一定の知識がある，あるいは統計の授業や教科書で … Read more →

# 一个简单的PTE教程

## by 邱睿

一个简单的PTE教程 […] … Read more →

# BIOL3360 - Analysis and Communication of Biological Data:

## by janengelstaedter

This online textbook contains learning material for the UQ (The University of Queensland) course BIOL3360: Analysis and Communication of Biological Data. This book is organised with each chapter corresponding to lectures from the Mathematical Modelling component of the course. This book contains many code chunks that can be copied and pasted into an R console to create Shiny apps of the models being discussed. Content and figures were created by Jan Engelstädter. Online version including Shiny apps were created by Nicole Fortuna. Read more →

# Bookdown

## by Arredondo Sánchez Andrea Elizabeth, Vargas Mendoza Ana Luisa

Yihui Xie es ingeniero de software de RStudio, autor de distintos paquetes como knitr, blogdown, xaringan, tinytex y bookdown. Además, ha colaborado en importantes paquetes como Shiny y RMarkdown. Ha publicado libros como “bookdown: Authoring Books and Technical Documents with R Markdown” del cual nos basaremos para hablar de Bookdown. Bookdown es un paquete de R que nos ayuda a integrar multiples documentos de R Markdown en un solo archivo con formato HTML, PDF,… . Este archivo puede ser un manual de usuario, nuestras notas de estudio e incluso nuestro diario. En él podemos agregar y editar … Read more →

# Steem Handbook

## by Steem 中文社区集体创作

Steem Handbook […] 本书的编写和维护需要长久的贡献，大门永远敞开，欢迎加入我们。你可以： 贡献本书缺失的内容； 修改已有内容； 修改错别字； 其他任何跟书稿编写有关的工作。 向本书项目投稿的方法见附录16。 主编：@dapeng 副主编： @maiyude 顾问（按字母顺序）：@deanliu @jademont @lemooljiang @oflyhigh @rivalhw @sweetsssj @tumutanzi 编剧： @maiyude 封面设计： @maiyude 本书的各章作者、编辑、校对见各章节脚注。待书稿完成后，名单将汇总在这里。 … Read more →

# The Good Loser – Results from Three Survey Experiments

## by sveinungarnesen78

The Good Loser – Results from Three Survey Experiments […] This is the analysis report for the Good Loser Project by Peter Esaiasson, Hannah Werner, and Sveinung Arnesen. The study comprises three survey embedded experiments; one video vignette experiment in Norway, one text vignette experiment in Sweden, and one conjoint experiment in Norway. The study has been presented at the Barcelona-Gothenburg-Bergen workshop on Experiments in Political Science in 2018, and will be presented at the 2019 Conference of the Midwestern Political Science Association in Chicago, USA. About Study I – Swedish … Read more →

# Data Visualization in R

## by Brooke Anderson

Online booklet for conference workshop on data visualization with R, geared to those who have never used R. […] I have based this workshop on examples for you to try yourself, because you won’t be able to learn how to program unless you try it out. I’ve picked example data that I hope will be interesting to Navy and Marine Corp public health researchers and practitioners. You can download the slides from the workshop by clicking here. To try out these examples, you need some set-up: This section will walk you through each step. R is free and open-source software. You can download a copy for … Read more →

# From my lovers and others. (Letters from 2013-2014)

## by Carlos Alcalá

This is a compendium of the letters written, sent and received during October 2013 until September 2014. […] Memory, knowledge, lives, even identities, all is distributed. We are socially fragmented. It could be used as an argument for a non-local consciousness theory. Therefore, with this text, I am just trying to compile pieces of what I have been in order to know a bit better what I am know. I have been told in the past that there is wisdom in the text that I wrote, and I am certainly sure that there is wisdom in the texts that I received. I hope you find something that makes your life … Read more →

# Machine Learning

## by Michael Clark

This document provides an introduction to machine learning for applied researchers. While conceptual in nature, demonstrations are provided for several common machine learning approaches of a supervised nature. In addition, all the R examples, which utilize the caret package, are also provided in Python via scikit-learn. […] … Read more →

# A Short Course on Nonparametric Curve Estimation

## by Eduardo García Portugués

A Short Course on Nonparametric Curve Estimation. MSc in Applied Mathematics. EAFIT University (Colombia). […] This course is intended to provide an introduction to nonparametric estimation of the density and regression functions from, mostly, the perspective of kernel smoothing. The emphasis is placed in building intuition behind the methods, gaining insights into their asymptotic properties, and showing their application through the use of statistical software. The software employed in the course is the statistical language R and its most common IDE (Integrated Development Environment) … Read more →

# Data Science avec R

## by Fousseynou Bah

Data Science avec R […] En décidant d’écrire un livre sur la data science, j’ai longuement débattu dans ma propre tête, je me suis posé plusieurs questions dont une qui revenait constamment: “a-t-on vraiment besoin d’un autre livre sur la data science?” “N’en-t-on pas assez?” Avec le succès dont jouit la discipline, ce n’est certainement pas les ressources qui manquent, aussi bien en ligne que dans les librairies. Et surtout, je me demandais bien “qu’avais-je à dire qui n’avait pas été dit”? Et pourtant, quelques raisons m’ont poussé à reconsidérer ma position. La première est assez égoïte. … Read more →

# Predictive Soil Mapping with R

## by Tomislav Hengl and Robert A. MacMillan

Predictive Soil Mapping aims to produce the most accurate, most objective, and most usable maps of soil variables by using state-of-the-art Statistical and Machine Learning methods. This books explains how to implement common soil mapping procedures within the R programming language. […] This is the online version of the Open Access book: Predictive Soil Mapping with R. Pull requests and general comments are welcome. These materials are based on technical tutorials initially developed by the ISRIC’s Global Soil Information Facilities (GSIF) development team over the period 2014–2017. This … Read more →

# Predictive Soil Mapping with R

## by Tomislav Hengl and Robert A. MacMillan

Predictive Soil Mapping aims to produce the most accurate, most objective, and most usable maps of soil variables by using state-of-the-art Statistical and Machine Learning methods. This books explains how to implement common soil mapping procedures within the R programming language. […] This is the online version of the Open Access book: Predictive Soil Mapping with R. Pull requests and general comments are welcome. These materials are based on technical tutorials initially developed by the ISRIC’s Global Soil Information Facilities (GSIF) development team over the period 2014–2017. This … Read more →

# PhD Training Workshop: Statistics in R

## by Anastasia Ushakova and Milan Valasek

This is the main page of the course and contains a course overview, schedule and learning outcomes. […] During this intensive workshop we will cover a number of introductions to topics which are core to statistical analysis in applied research. This will include introduction to R as a tool to analyse data, visualize it and to use it for a very very basic analysis of the relationships in your data. We will further revise some of the most commonly used statistical tests and provide you with a guidance how to set up and interpret them in R. Lastly, we will introduce you to simple linear model … Read more →

# Advanced R Course

## by Florian Privé

This contains materials for the Advanced R course of the doctoral school of Grenoble, France (2018). […] This material is licensed under the Creative Commons Attribution-ShareAlike 3.0 License. Florian Privé is a PhD student in predictive human genetics, fond of Data Science and an R(cpp) enthusiast. He is also the founder and co-organizer of the Grenoble R user group. You can find him on Twitter and GitHub as @privefl and on Stack Overflow as F. Privé. … Read more →

# Utah TDS wqTools vignette

## by Jake Vander Laan, Utah Division of Water Quality

Utah TDS wqTools vignette […] This vignette shows an example of using wqTools functions to extract and analyze statewide patterns of one water quality parameter, total dissolved … Read more →

# Chapitre 4 Importer des données dans R | Data Science avec R

## by Fousseynou Bah

Chapitre 4 Importer des données dans R | Data Science avec R […] Dans le flux de travail (workflow) du data scientist, l’importation constitue très généralement le point de départ. Les données ne sont toujours disponibles sous le format qui se prête à l’analyse souhaitée. Elles peuvent exister dans un classeur Excel sous format xls, xlsx ou csv. Elles peuvent aussi se trouver dans une base de données relationnelles, où diverses tables sont connectées entres elles. Elles peuvent même être disponibles sur Internet (page Wikipédia, Twitter, Facebook, etc.) Dans tous les cas, il revient au data … Read more →

# Minimal-Git-demo

## by PoMingChen

This is a minimal example of Git service through GitHub and the GitHub Desktop. […] 小瑜是一位社會人文科學相關主修的學生，學習上常常會需要寫報告，動則數千字到上萬字，以下是他管理檔案的方式，他承認有時候快被自己氣死…..會不會有時自己也這樣XD 截圖 後來，因緣際會地留意到Git這個東西，一套能夠讓開發者得以進行版本控制的程式。 往後用了Git之後，從此事半功倍好棒棒，檔案內容追蹤管理都方便許多，一起來瞧瞧Git到底是哪裡這麼厲害！ 與其他教材稍有不同的是，這本書規劃先從輕鬆的GitHub平台環境介紹開始，版本控制的學習則用圖形化介面（GUI）的GitHub Desktop實作來建立觀念，同時說明上嘗試以情境實作的方式來想像Git能 … Read more →

# Generalized Additive Models

## by Michael Clark

An introduction to generalized additive models (GAMs) is provided, with an emphasis on generalization from familiar linear models. It makes extensive use of the mgcv package in R. Discussion includes common approaches, standard extensions, and relations to other techniques. More technical modeling details are described and demonstrated as well. […] … Read more →

# An Incomplete Solutions Guide to the NIST/SEMATECH e-Handbook of Statistical Methods

## by Ray Hoobler

Analysis of case studies and exercies with a focus on using the tidyverse and ggplot2. This handbook was created using the bookdown package in RStudio. The output format for this example is bookdown::gitbook. […] Exploratory Data Analysis (EDA) is a philosophy on how to work with data, and for many applications, the workflow is better suited for scientist and engineers. As a scientist, we are trained to formulate a hypothesis and design a series of experiments that allow us to test the hypothesis effectively. Most data, however, doesn’t come from carefully controlled trials, but from … Read more →

# The Status Quo Bias in Referendums

## by Sveinung Arnesen, Troy S. Broderstad, Mikael P. Johannesson, Jonas Linde

This is an analysis report of a comparative conjoint study on the legitimacy of EU referendums. […] This is the analysis report for the conjoint experiment of the Wiggle room study by Sveinung Arnesen, Troy S. Broderstad, Mikael P. Johannesson, and Jonas Linde. The experiment was fielded in France, Germany, Iceland, Norway, Sweden, and the Netherlands as part of the 2017 European Internet Panel Study (EIPS); a collaboration between six European probability-based online survey panels. The 2017 joint survey wave was fielded in France by the L’ ́etude longitudinale par internet pour les … Read more →

# Practical R Package Development (Japanese)

## by Hiroaki Yutani

Practical R Package Development […] Rのパッケージ開発については「R Packages」（Hadley Wickham、2015）に詳しいが、Rのパッケージ開発にはここ数年で様々な変化があった。 幸い、同書は第2版に向けて大幅に書き直される予定1なので、賢明なRパッケージ開発者はそれを待つのがいいだろう。本書は、あくまでもそれまでのつなぎのような存在として、むしろ筆者のメモ代わりとして、衝動的に書き殴られたものだ。Rパッケージ開発の基礎はすっとばし、新たなトピックを中心に取り扱う。信用がおける知識についてはあくまでも「R Packages」を参照されたい。 本書は、「R Packages」に載っていないことを中心に書く、という性質上、あまり初心者向けではないかもしれない … Read more →

# «Волопас и Северная Корона»

## by Dmitry Gorodnichy

Tales and Songs of Dmitry Gorodnichy […] Давным давно, когда ни тебя, ни меня, ни даже моих пра-пра бабушек, пра-пра дедушек ещё не было, да и вообще людей ещё не было, а было только Небо и были Звёзды, жил был принц, которого звали Волопас, и принцесса, которую звали Северная Корона. Они не знали друг друга, жили в разных странах, разговаривали на разных языках. Но одно у них было общее - они одинаково любили красоту и музыку, как часть этой красоты. – А почему их так необычно звали? И что дальше было? – О, Это очень длинная и очень красивая история, а точнее много разных историй. Но … Read more →

# Mixed Models in R

## by Michael Clark

This is an introduction to mixed models in R. It covers a many of the most common techniques employed in such models, and relies heavily on the lme4 package. The basics of random intercepts and slopes models, crossed vs. nested models, etc. are covered. Discussion includes extensions into generalized mixed models and realms beyond. […] … Read more →

# Entre le terre et le ciel

## by ΔΓ

by Dmitry Gorodnichy […] Песни сегодняшнего и завтрашнего дня (Песни с земли и с неба) На украинском, русском, французском и английском. Зимний блюз Сверху … Read more →

# «Детский альбом»

## by Дмитрий Городничий

«Детский альбом» […] Музыка и слова: Дмитрий Городничий, кроме 6 (слова и музыка: Роксана Городничая), 16-17 (слова: Афанасьева), 21 (музыка: Гедике), 22 (музыка: Глинка) Версия для печати: PDF, EPUB. Online: https://bookdown.org/gorodnichy/amour (рабочая версия: http://IVIM.ca/dg/amour). Слушать на Soundcloud: https://soundcloud.com/dmitry-gorodnichy/sets/amour. Книга издана с использованием bookdown. Источник на github. Dmitry Gorodnichy, IVIM Inc. © 1992-2012. All Rights Reserved. The use of copyrighted material for non-commercial purposes is allowed. The reference to the source is … Read more →

# «Le long de la voie lactée» (Love songs

## by Дмитрий Городничий

«Le long de la voie lactée» (Love songs […] Песни неразделённой любви. На украинском, русском, французском и английском языках. Музыка и слова: Дмитрий Городничий. В записи альбома также участвуют: Андрей Городничий, Екатерина Лаврентьева, Людмила Городничая. Версия для печати: PDF, EPUB. Online: https://bookdown.org/gorodnichy/amour (рабочая версия: http://IVIM.ca/dg/amour). Слушать на Soundcloud: https://soundcloud.com/dmitry-gorodnichy/sets/amour. Книга издана с использованием bookdown. Источник на github. Dmitry Gorodnichy, IVIM Inc. © 1992-2012. All Rights Reserved. The use of … Read more →

# «По ту сторону горизонта»

## by Дмитрий Городничий

Farewell songs (by Dmitry Gorodnichy) […] В этот альбом вошли песни, рождённые чувствами расставания - расставания с домом, с юностью, любимым человеком, с эпохой. Они охватывают несколько эпох - от самых ранних моих песен, когда я расставался с институтом, где учился, и своими первыми студенческими друзьями, до песен, посвященных моему отъезду из дома в далёкую Канаду, и наконец последних, выплеснувшихся под болью утраты близких и родных. Несколько редких семейных видео вошли сюда, также как и несколько выступлений в русских школах и на фестивалях авторской песни. Музыка и слова: Дмитрий … Read more →

# What does the plant do?

## by Otho Mantegazza

A Planter’s Punch that quickly got out of hand […] I wrote this booklet a couple of years ago, while I was working at CEPLAS. – Plants collect energy from sunlight and use it to produce fruits that we eat, fibers that we wear and much, much more; in a process called photosynthesis. This process is fundamental for our life on Earth, and it has been intensively studied for centuries by scientists. Scientists like me, like us. Here I’ll give you a glimpse of our scientific research. After a very short introduction to photosynthesis, I’ll explain to you one of its details and one of the methods … Read more →

# Data Science con R: Fundamentos y Aplicaciones

## by BEST: Behavioral Economics & Data Science Team

El mejor libro en espanol de ciencia de datos, libre y abierto. […] Nota: El libro se encuentra en etapa de desarrollo. Este libro ha sido elaborado por BEST. Hace unos años el término Data Science no era tan conocido ni utilizado por la comunidad internacional, y menos aún local (Perú). En realidad, era un término usado rara vez por los estadísticos y algunos miembros de la computación científica. Y es que nuestra sociedad ha evolucionado, y con ellos ciertas necesidades. La Ciencia de Datos ha venido para quedarse, y en cualquier profesión (economistas, psicólogos, biólogos, … Read more →

# APS 135: Introduction to Exploratory Data Analysis with R

## by Dylan Z. Childs

Course book for Introduction to Exploratory Data Analysis with R (APS 135) in the Department of Animal and Plant Sciences, University of Sheffield. […] This is the online course book for the Introduction to Exploratory Data Analysis with R component of APS 135, a module taught by the Department and Animal and Plant Sciences at the University of Sheffield. You can view this book in any modern desktop browser, as well as on your phone or tablet device. Dylan Childs is running the course this year. Please email him if you spot any problems with the course book. You will be introduced to the R … Read more →

# Broadening Your Statistical Horizons

## by Julie Legler and Paul Roback

An applied textbook on generalized linear models and multilevel models for advanced undergraduates, featuring many real, unique data sets. It is intended to be accessible to undergraduate students who have successfully completed a regression course. Even though there is no mathematical prerequisite, we still introduce fairly sophisticated topics such as likelihood theory, zero-inflated Poisson, and parametric bootstrapping in an intuitive and applied manner. We believe strongly in case studies featuring real data and real research questions; thus, most of the data in the textbook arises from collaborative research conducted by the authors and their students, or from student projects. Our goal is that, after working through this material, students will develop an expanded toolkit and a greater appreciation for the wider world of data and statistical modeling. Read more →

# Machine Learning with Rust

## by Tae Geun Kim

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] 최근들어 기계학습(Machine Learning)은 점차 중요해지고 있습니다. 학습된 기계들은 바둑이나 게임에서부터 프로들을 가뿐히 눌렀고, 연구나 업무를 훨씬 효율적으로 해결합니다. 그러나 단순히 모두가 한다고 해서 섣부르게 시작하다가는 결과가 나와도 해석하지 못하거나 혹은 애초에 잘못된 결과가 나올 수도 있습니다. 따라서 이 책에서는 단순히 Machine Learning Framework를 사용하는 것이 아닌, 밑바닥부터 차근차근 이론을 적용하여 Machine Learning을 학습하고자 합니다. 그러기 위해서 우리는 Rust라는 프로그래밍 언어와 매우 유명한 Bishop의 … Read more →

# Bilingual Christmas songs

## by gorodnichy

A compilation of Christmas song favourites for multi-lingual families, with chords […] … Read more →

# 7 Agrupación de la información | Estadística y Machine Learning con R

## by Francisco Parra

7 Agrupación de la información | Estadística y Machine Learning con R […] Tanto las técnicas de reducción de dimensiones como las de agrupamiento, están basadas en determinar la semejanza (proximidad, similaridad) o disparidad (distancia, disimilaridad) existente; entre las variables las primeras, entre los individuos/variables las segundas. Lo primero a decidir será, pues, si optamos por centrar el análisis en medir disparidad o semejanza, lo cual dependerá en buena parte de los objetivos planteados en la investigación. Otra cuestión a considerar a la hora de optar por una medida u otra es … Read more →

# UPR-PRISE Data Science Workshop 01/26/2019

## by Felix E. Rivera-Mariani, PhD

This manual is part of data science workshop titled GPS of Data Analytics: Making the Witness (the Data) Confess. The output format for was elaborated with bookdown::gitbook. […] Welcome to the data science workshop titled The GPS of Data Analytics: Making the Witness (the Data) Confess. In this workshop, sponsored by the University of Puerto Rico Ponce Research Initiative for Scientific Enhancement, students will learn and implement different aspects of data science, from establishing a set of tools necessary to carry out data science to deploying statistical models through coding, … Read more →

# CFPS 之R语言学习笔记

## by 王敏杰

一个简单的中文书示例。 […] 你好，我的初衷是想记录学习北京大学开放数据平台中的中国家庭追踪调查CFPS1数据集的过程，也帮助同学们用R语言快速的重复相关研究。 这本书是这样的， 第 1 章介绍CFPS的情况， 第 2 章介绍农村土地流转研究， 第 3 章介绍劳动力流动、家庭收入与农村人力资本投资， 第 4 章介绍社会资本与精准扶贫，然后是啥啥…… 每个章节研究的内容都是彼此独立的，大家可以单独阅读和运行代码。 我用了两个 R 包编译这本书，分别是 knitr (Xie 2015) 和 bookdown (Xie 2018)。以下是我的 R 进程信息： 非常感谢谁谁以及谁谁对我的帮助。艾玛，要不是他们神一样的队友，我两年前就写完这本书了。 Xie, Yihui. 2015. Dynamic Documents with R and Knitr. 2nd ed. Boca Raton, Florida: Chapman; Hall/CRC. … Read more →

# Gijón Air Pollution - An exercise of visualization and forecasting

## by Sergio Berdiales

Gijón Air Pollution - An exercise of visualization and forecasting […] My name is Sergio Berdiales and I am a Data Analyst with more than ten years experience in Customer Experience and Quality areas. If you want to know more about me or contact me you can visit my Linkedin profile or my Twitter account. This is my final project for the Kschool Master on Data Science (8th edition). The main objective of this project is to show I can apply the acquired knowledge during the master’s course in a practical way . The Master on Data Science of Kschool is a 230-hour course which includes Python … Read more →

# Supplement to Shiny in Production

## by kellobri.github.io

This document is full of supplemental resources and content from the Shiny in Production Workshop delievered at rstudio::conf 2019. … Read more →

# Ian and Molly’s Odyssey

## by Ian K Salter

The story of a small journey in Autumn 2016. […] This very short book documents a voyage through France taken in the autumn of 2016. Its two participants head south in their car Dot. The purpose of their journey is … Read more →

# Learning statistics with R: A tutorial for psychology students and other beginners. (Version 0.6.1)

## by DJ Navarro

Learning Statistics with R covers the contents of an introductory statistics class, as typically taught to undergraduate psychology students, focusing on the use of the R statistical software. The book discusses how to get started in R as well as giving an introduction to data manipulation and writing scripts. From a statistical perspective, the book discusses descriptive statistics and graphing ﬁrst, followed by chapters on probability theory, sampling and estimation, and null hypothesis testing. After introducing the theory, the book covers the analysis of contingency tables, t-tests, … Read more →

# Visualization

## by Stanford Data Lab

This is a book on data visualization using ggplot2 created for the Stanford Data Challenge Lab. […] This is a … Read more →

# Statistical Rethinking

## by Brynjólfur Gauti Jónsson

These are solutions from the book by Richard McElreath. … Read more →

# Referentiekaarten

## by Laura Moria

Referentiekaarten […] Voor de KRW is een groot deel van het oppervlaktewater aangewezen als waterlichaam. Een waterlichaam is een “onderscheiden oppervlaktewater van aanzienlijke omvang, zoals een meer, een rivier of een kanaal”. Voor deze wateren moet de toestand van het aquatisch ecosysteem beschreven worden. Onder oppervlaktewateren van “aanzienlijke omvang” vallen waterlichamen met een minimale oppervlakte van 0,5 km2 of een stroomgebied tussen de 10 en 100 km2. In onderstaande afbeelding staan de KRW waterlichamen in het beheergebied van AGV. In de Kaderrichtlijn Water (KRW) is een … Read more →

# Ecologische sleutelfactoren in beeld

## by Laura Moria

De informatie over de huidige situatie en ontwikkelingen van het aquatisch ecosysteem in de regio Amstel, Gooi en Vechtstreek bundelen wij in een zogenaamde Atlas met thema kaarten. Onze doelstellingen en de huidige ecologische kwaliteit zijn verbeeld in de afbeeldingen hieronder. Op een andere pagina (Waterkwaliteit in beeld) staan kaarten van verschillende indicatoren van ecologische kwaliteit voor de Habitatrichtlijn (Natura2000) en de Kaderrichtlijn water. Naast de ontwikkeling van de ecologische toestand wordt ook verbeeld welke processen deze toestand bepalen in het hoofdstuk hieronder … Read more →

# Waterkwaliteit in beeld

## by Laura Moria

Waterkwaliteit in beeld […] De informatie over de huidige situatie en ontwikkelingen van het aquatisch ecosysteem in de regio Amstel, Gooi en Vechtstreek bundelen wij in een zogenaamde Atlas met thema kaarten. Onze doelstellingen en de huidige ecologische kwaliteit zijn verbeeld in de afbeeldingen hieronder. In hoofdstuk 2 staan kaarten van verschillende indicatoren van ecologische kwaliteit voor de Habitatrichtlijn (Natura2000) en de Kaderrichtlijn water. Naast de ontwikkeling van de ecologische toestand wordt ook verbeeld welke processen deze toestand bepalen op een andere pagina … Read more →

# Economía Conductual: Fundamentos y Aplicaciones

## by BEST: Behavioral Economics & Data Science Team

El mejor libro en español de economía conductual, libre y abierto. […] Entender el comportamiento de las personas o de la sociedad, es un tema fascinante. Hace algunos siglos atrás, el profeta Isaías escribió: La economía conductual toca este tema, desde una perspectiva cientìfica, apoyado de la psicología y economía. El libro se compone de 4 partes. Parte I cubre la parte introductoria. El capítulo I Adam Smith, Padre de la Economía Conductual, se enfoca en los orígenes de la economía y de la economía conductual, ambos teniendo a Adam Smith como padre de ambos campos de estudio. El … Read more →

# Comparing Social Dynamics of a Rental and Purchased Block

## by Jolene Quek

Comparing Social Dynamics of a Rental and Purchased Block […] … Read more →

# Tidyverse Cookbook

## by Malte Grosser

Simple cookbook for functions and idioms within the scope of the tidyverse. […] The basic idea of this book is to provide a documentation of the tidyverse written in a solution driven cookbook style. As an extra I would like to provide similar solutions based on base R functionality. Some reasons to write this book: One strength of the tidyverse is that it hides a lot of quirks that base R provides and inherits to many packages that rely on it. This allows to stick to a specific workflow from the point you enter the tidyverse until you leave it. This is why I highly recommend to head your … Read more →

# Big data and Social Science

## by Paul C. Bauer

Script for the seminar ‘Big Data and Social Science’ at the University of Bern. […] The present document serves both as slides and script for the workshop/seminar Big Data and Social Science. This seminar is taught by Paul C. Bauer at the University of Bern (Fall Semester 2018). The material was developed by Paul C. Bauer and heavily draws on material developed by Pablo Barberà in courses such as Social Media & Big Data Research, Big Data Analysis in the Social Sciences and Automated Collection of Web and Social Data. Any original material and examples is licensed under a Creative Commons … Read more →

# Base de datos corporativa de personas

## by Fernando Izco

Documentación de la prueba de concepto de la base de datos corporativa de personas […] Este documento describe principalmente la prueba de concepto ejecutada como parte del estudio de viabilidad 22173 EV - Base de datos corporativa de personas Este primer capítulo es un resumen ejecutivo. En caso de querer profundizar más sin perderse en detalles técnicos, consultar también el capítulo 2, ‘Concepto de Solución’. Los demás capítulos describen la prueba de concepto, con el detalle técnico de la implementación. Atariak eta Ezagutza Kudeatzeko Atala / Sección de Portalización y Gestión del … Read more →

# A First Course on Statistical Inference

## by Isabel Molina Peralta and Eduardo García Portugués

352# POSEIDON tutorial

## by Ernesto Carrella

This is a basic tutorial on how to use POSEIDON and set it up to explore basic fishery problems. I try to cover everything that does not require changing any of the Java code […] This is a simple tutorial on using POSEIDON, a fishery agent-based model. You can read more about this project by reading its main paper or looking at the code repository. This guide will not explain or require any analysis of the java code. I try here to simply show what can be done by just using the graphical user interface and basic text … Read more →

# Arboles de decision y Random Forest

## by Johanna Orellana Alvear - johanna.orellana@ucuenca.edu.ec

Arboles_de_decision_y_Random_Forest […] “The key to artificial intelligence has always been the representation.” —Jeff Hawkins Aquí los detalles del curso . Aquí los datos que usaremos durante el curso. Breve recapitulación de R (Capítulo 2) Entorno de RStudio y ayuda (0.2 h) Directorios, scripts y librerías (0.3 h) Tipos de datos básicos y compuestos (0.5 h) Lectura y escritura de archivos (0.5 h) Indexación (1 h) Subconjuntos (1 h) Funciones (0.5 h) Arboles de Decisión - parte I Arboles de Decisión - parte II Random Forest - parte I Random Forest - parte … Read more →

# Advanced Spatial Modeling with Stochastic Partial Differential Equations Using R and INLA

## by Elias T. Krainski, Virgilio Gómez-Rubio, Haakon Bakka, Amanda Lenzi, Daniela Castro-Camilo, Daniel Simpson, Finn Lindgren and Håvard Rue

Advanced Spatial Modeling with Stochastic Partial Differential Equations Using R and INLA […] This book grew out of a tutorial written by Elias T. Krainski, which he started in 2013 together with his PhD-studies at NTNU, Trondheim, Norway. The tutorial has since then been expanded continuously, based on response from the many users and based on new developments. Lindgren, Rue, and Lindström (2011) describe an approximation to continuous spatial models with a Matérn covariance that is based on the solution to a stochastic partial differential equation (SPDE). This approximation is computed … Read more →

# Notes for ST463/ST683 Linear Models 1

## by Katarina Domijan, Catherine Hurley

These are the notes for ST463/ST683 Linear Models 1 course offered by the Mathematics and Statistics Department at Maynooth University. This module is offered at as a part of of MSc in Data Science and Data Analytics. It is an introductory course for students who have basic background in Statistics, Data analysis, R Programming and linear algebra (matrices). […] There are many good resources, e.g. Weisberg (2005), Fox (2005), Fox (2016), Ramsey and Schafer (2002), Draper and Smith (1966). We will use Minitab and R (R Core Team 2017). To create this document, I am using the bookdown package … Read more →

# Escritura de libros con bookdown

## by Fernández-Casal, R. y Cotos-Yáñez, T.R.

Este libro es una introducción al paquete bookdown para la escritura de libros (en castellano, galego, …). […] Este libro es una pequeña guía sobre como emplear el paquete bookdown de R para la escritura de libros, incluyendo algunos detalles de configuración para la escritura en otros idiomas distintos del inglés (castellano, galego,…). Este mismo libro ha sido escrito en R-Markdown empleando el paquete bookdown y está disponible en el repositorio Github: rubenfcasal/bookdown_intro. Para generar el libro (compilar) puede ser recomendable instalar la última versión de RStudio y la versión … Read more →

# Computational Communication Science mit R

## by André Calero Valdez

Dieses Buch befindet sich zur Zeit in Arbeit. […] Dieses Buch soll einen Überblick über Computer-basierte Methoden der Kommunikationswissenschaft verschaffen und in Form eine Lehrbuchs die wichtigsten Inhalte zusammenfassen. Zu allen Themen, die in diesem Buch bearbeitet werden, gibt es bereits besser geeignete Bücher, die die entsprechenden Theorien, Methoden und Techniken detailliert und ausführlich betrachten. An geeigneter Stelle wird auf diese Quellen verwiesen. Die zentrale Idee hinter diesem Buch ist die Vereinheitlichung des Forschungsprozesses und die digitale Unterstützung durch … Read more →

# New statistics for the design researcher

## by Martin Schmettow

A statistics book for designers, human factors specialists, UX researchers, applied psychologists and everyone else who works hard to make this world a better place. […] This book makes the following assumptions: Chapter @ref(design_research) introduces a framework for quantitative design research. It carves out the basic elements of empirical design research, such as users, designs and performance and links them to typical research problems. Then the idea of design as decision making under uncertainty is developed at the example of two case studies. Chapter @ref(bayesian_statistics) … Read more →

# R 语言分析 LI-6400 和 LI-6800 光合仪的数据

## by 祝介东 北京力高泰科技有限公司

R 语言分析 LI-6400XT 与 LI-6800 数据 […] 在 plantecophys 包中使用的模型为 Farquhar, Caemmerer, and Berry (1980) 建立的 C3 植物模型 FvCB，其基于 C3 植物碳反应的三个阶段： 核酮糖-1,5-双磷酸羧化酶/加氧酶 (Rubisco)的催化下, 核酮糖-1,5-双磷酸(RuBP)与 CO2发生羧化作用, 生成3-磷酸甘油酸(PGA)。 在腺苷三磷酸(ATP)和还原型烟酰胺腺嘌呤 二核苷酸磷酸(NADPH)的作用下, PGA被还原成磷 酸丙糖(TP)。每6个TP中有1个输出到细胞液中, 用 于蔗糖或者淀粉的合成。 剩下的5个TP 在ATP的作用下再生为 3 个RuBP。一部分再生的 RuBP在Rubisco的催化下被氧化成PGA和2-磷酸乙 醇酸, 2-磷酸乙醇酸在ATP的作用下形成PGA, 并且 释放CO2 (光呼吸)。 在光照下, C3 植物净光合速率 (A) … Read more →

# Ecologische waterkwaliteit Botshol

## by lauramoria

Ecologische waterkwaliteit Botshol […] AGV is als waterbeheerder verantwoordelijk dat de wateren in haar beheergebied voldoen aan de waterkwaliteitsdoelstellingen van de Europese Kaderrichtlijn Water (KRW) en aan doelstellingen die zijn geformuleerd in het Natura2000 beheerplan. Deze richtlijnen hebben als einddoel schoon en gezond water. Met voldoende kranswieren, fonteinkruiden en … Read more →

# Mastering DFS Analytics

## by M. Edward (Ed) Borasky

Mastering DFS Analytics is a data-driven program to improve your daily fantasy sports results. You’ll learn and much more. Written by an applied mathematician, Mastering DFS Analytics will give you contest-tested tools. In addition to the ebook, you get Comments? Questions? @znmeb_dfs on Twitter Mastering DFS Analytics by M. Edward (Ed) Borasky is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Mastering DFS Analytics on … Read more →

# Tidy Portfoliomanagement in R

## by Sebastian Stöckl

First try on a book on tidy Portfolio Managment in R. […] This book should accompany my lectures “Research Methods”, “Quantitative Analysis”, “Portoliomanagement and Financial Analysis” and (to a smaller degree) “Empirical Methods in Finance”. In the past years I have been a heavy promoter of the Rmetrics tools for my lectures and research. However, in the last year the development of the project has stagnated due to the tragic death of its founder Prof. Dr. Diethelm Würtz. It therefore happened several times that code from past semesters and lectures has stopped working and no more support … Read more →

# Lösningar i R till vissa uppgifter från övningskompendierna

## by Erik Stenberg

Lösningar för vissa uppgifter i kursen Statstik A4/A8 […] Detta dokument är till för dig som läser kursen Statistik A4/A8 och är nyfiken på R. Innehållet är tänkt att förena lite nytta (lösa uppgifter) med nöje (lära dig lite R). Det är inte meningen att detta dokument skall fungera som en heltäckande introduktion till programmeringsspråket R. Det finns mängder av väldigt välskrivna guider online som fokuserar mycket mer på hur språket är uppbygt. Lyckligtvis är R väldigt enkelt att komma igång med, och det krävs inte mycket förståelse för själva språket för att göra enkla beräkningar, … Read more →

# 文科生数据科学上手指南

## by 王树义

你大概经常听别人提起，技术的门槛在降低。 数据科学、机器学习、自然语言处理、神经网络、人工智能……一系列的名词让你眼花缭乱，让你对这个时代充满兴奋的感觉。你跃跃欲试，希望自己动手，也能用新技术做出卓有成效的工作。 但是，如果你不是IT专业的学生，特别糟糕的是，你还是一名文科生，那你可能会逐渐发现，技术的世界似乎不那么友好。 你只想对文本提取主题，人家给你写了这么长的公式： 你想做个时间序列的预测，结果人家告诉你，一个处理单元，就有这样的结构： 除了迅速“从入门到放弃”，你还能怎么办？ 别急，这不是真相。 真相是，只要你知道如何找到正确的工具包，就可以用短短几行代码，完成从前需要手工干几天的活儿。不信？可以看看我这篇 … Read more →

# Clustered Data

## by Michael Clark m-clark.github.io

This document provides a brief comparison of various approaches to dealing with clustered data situations. […] … Read more →

# Graphical & Latent Variable Modeling

## by Michael Clark m-clark.github.io

This document focuses on structural equation modeling. It is conceptually based, and tries to generalize beyond the standard SEM treatment. It includes special emphasis on the lavaan package. Topics include: graphical models, including path analysis, bayesian networks, and network analysis, mediation, moderation, latent variable models, including principal components analysis and ‘factor analysis’, measurement models, structural equation models, mixture models, growth curves, item response theory, Bayesian nonparametric techniques, latent dirichlet allocation, and more. Read more →

# R 语言入门，给一心只有学习的你

## by Chris Qi from Data Maniac

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] 想直接上手的同学，可以跳过这一部分，从安装软件开始。如果软件已经安装了，可以跳到第二章。对于喜欢把书从头读到未的同学，欢迎从这里开始。 看到这个题目，你以为我会跟你絮絮叨叨讲一个软件的发展史？这种东西听一耳朵就可以了，写出来都浪费纸墨，噢，这是电子书，不用纸也不用墨，但是打字也费劲儿呀。所以在这里，我就做个大概介绍吧： R是一门用于统计计算和作图的语言，由S语言发展而来，以统计分析功能见长。 R 是新西兰的罗斯.伊哈卡 (Ross Ihaka)和罗伯特.金特尔曼（Robert … Read more →

# An Introduction to Text Processing and Analysis with R

## by Michael Clark m-clark.github.io

This document covers a wide range of topics, including how to process text generally, and demonstrations of sentiment analysis, parts-of-speech tagging, word embeddings, and topic modeling. Exercises are provided for some topics. […] … Read more →

# IsoriX: Isoscape Computation and Inference of Spatial Origins using R

## by The IsoriX core Team

This book is the official documentation for the R package IsoriX. […] This new documentation of the R package IsoriX which aims at replacing the former vignettes and will ultimately provide much more information than before. The chapters 1 to 5 are almost complete but you will have to wait for the other chapters to follow. … Read more →

# Cod Prediction

## by Liam

关于货到付款支付方式的客户的拒收预测 […] 货到付款是一种是一种非常受用户青睐的支付方式，对于客户而言，货到付款更加安全，特别是对于一些电商不发达的的确，货到付款能够有效的打消用户对于网购的不信任。 对于商家而言，货到付款这种支付方式不利于现金的流动，并且，有一部分人会在货到了之后不付款，也就是拒收。拒收的原因很多，很简单的就是不想要了。 一般而言，货到付款的拒收率可以高达20%，这将造成很大的运营成本。因此，本文利用机器学习的方法，对用户是否回拒收进行预测。 … Read more →

# Data Visualization with R

## by Rob Kabacoff

A guide to creating modern data visualizations with R. Starting with data preparation, topics include how to create effective univariate, bivariate, and multivariate graphs. In addition specialized graphs including geographic maps, the display of change over time, flow diagrams, interactive graphs, and graphs that help with the interpret statistical models are included. Focus is on the 45 most popular graph types. The guide also includes detailed instructions on how to customizing graphs, and ends with a chapter on graphing best practices. Although strongly based on the ggplot2 package, other approaches are included as well. Read more →

# An Introduction to R and LaTeX

## by Yuleng Zeng

An introduction to R for political scientists. […] This is an introduction to R and Latex. In compiling this documents, several sources have been consulted, including Tim Peterson’s website, Havard’s Math Prefresher, and the course offered by DataCamp. Make sure that you have a laptop throughout this introduction. Install the following applications, if you haven’t done so. Finally, this document is to be used in-class only. As I (will) mention several times, it borrows and merges a lot of resources online. Also, if you see any mistakes or have suggestions, please do shoot me an … Read more →

# R para principiantes

## by Juan Bosco Mendoza Vega

Un libro introductorio a R, dirigido a personas sin experiencia previa con lenguajes de programación. […] Propósito del libro R para principiantes pretende ser un materal introductorio al lenguaje de programación R, dirigído a personas que nunca han usado R o ningún otro lenguaje de programación, ni tiene conocimiento previo de probabilidad y estadística. Este libro tiene como propósito que adquieras los fundamentos del uso de R como un lenguaje de programación, desde sus conceptos más elementales, hasta la definición de funciones y generación de gráficos. No son objetivos de este libro que … Read more →

# Basic Social Justice Orientations scale testing

## by Cristóbal Moya

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] The original publication has three tables where ALLBUS-2014 is used: The following analysis are focused on the first two tables (Table 15 and Table 8), because they contain the main resutls regarding this data source and the table from supplementary materials should not matter as long as factor loadings in Table 8 are correct. The descriptive statistics of the eight items displayed in Table 15 of the original article are reproduced from the article’s website … Read more →

# TensorFlow 学习笔记

## by Leo Van | 范叶亮

TensorFlow 学习笔记 […] 本作品是针对 Tensorflow 深度学习框架的学习笔记，参考的相关资料包括： 本作品使用 R 语言的 Bookdown 扩展包构建，在线版本托管在 https://bookdown.org/leovan/TensorFlow-Learning-Notes ，离线版本请访问托管网站下载。 本作品中使用的部分图标来自 Papirus 图标集。 本作品编译的 PDF 采用 Chapman & Hall 出版社提供的 LaTeX 模板 krantz.cls，英文衬线字体采用 Alegreya，英文无衬线字体采用 Helvetica，中文衬线字体采用 Source Han Serif SC，中文无衬线字体采用 Source Han Sans SC，中文斜体字体采用 Kaiti SC，中英文等宽字体采用 Sarasa Mono SC，数学公式字体采用 Latin Modern Math。 本作品采用 … Read more →

# Introducción a estadística con R

## by Matias Andina

Este libro introduce conceptos de estadística utilizando R. Está principalmente orientado a estudiantes que deseen aplicar e incrementar sus conocimientos estadísticos usando un lenguaje de programación. Sin embargo, aquellos usuarios que tengan algo de experiencia con R y quieran aventurarse a aumentar sus conocimientos estadísticos pueden encontrar utilidad en los capítulos más avanzados. […] R es quizás el lenguaje más desarrollado para realizar análisis exploratorios de datos y estadística. Debido a que posee una naturaleza dinámica, gratuita, open-source, y una comunidad que trabaja … Read more →

# R语言忍者秘笈

## by 谢益辉, 肖楠, 坑主三, 坑主四

本书要写什么其实我也不太清楚。迷迷瞪瞪中，感觉写一些奇门遁甲之术会比较有趣吧，算是程序猿/媛的自娱自乐了，如果在自娱自乐之外，读者能学到一些有用的技能，那就更好了。 […] R语言 (R Core Team 2018) 是由统计学家发明的一门程序语言，这个特殊的背景让这门语言在计算机专业人士眼中看起来也许很奇怪：语法松散、数据结构不严谨、充斥着黑魔法，等等。如果能结合数据分析的背景去看待它，就会发现它还是有很多精妙之处的。 学一门语言不可能通过两天时间把语法看完了事就行，必须得实战练习：一来巩固语法，二来增加经验值。本书根据统计之都论坛（http://cos.name/cn/）六年中近六千帖子和三万回帖整理并加入作者的个人经验而写成。我们找的有这样几种帖子： … Read more →

# Bayesian Basics

## by Michael Clark

This document provides an introduction to Bayesian data analysis. It is conceptual in nature, but uses the probabilistic programming language Stan for demonstration (and its implementation in R via rstan). From elementary examples, guidance is provided for data preparation, efficient modeling, diagnostics, and more. […] … Read more →

# Field Epidemiology with R

## by Tomás J. Aragón

A book example for a Chapman & Hall book. […] The document format “R Markdown” was first introduced in the knitr package (Xie, 2015, 2018) in early 2012. The idea was to embed code chunks (of R or other languages) in Markdown documents. In fact, knitr supported several authoring languages from the beginning in addition to Markdown, including LaTeX, HTML, AsciiDoc, reStructuredText, and Textile. Looking back over the five years, it seems to be fair to say that Markdown has become the most popular document format, which is what we expected. The simplicity of Markdown clearly stands out among … Read more →

# A short course on Survival Analysis applied to the Financial Industry

## by Marta Sestelo

This is a short course on survival analysis applied to the financial field. […] This book is designed to provide a guide for a short course on survival analysis. It is mainly focussed on applying the stastical tecnquines developed in the survival field to the financial industry. The emphasis is placed in understanding the methods, building intuition about when aplying each of them and showing their application through the use of statistical … Read more →

# Advanced Statistical Computing

## by Roger D. Peng

The book covers material taught in the Johns Hopkins Biostatistics Advanced Statistical Computing course. I taught this course off and on from 2003–2016 to upper level PhD students in Biostatistics. The course ran for 8 weeks each year, which is a fairly compressed schedule for material of this nature. Because of the short time frame, I felt the need to present material in a manner that assumed that students would often be using others’ software to implement these algorithms but that they would need to know what was going on underneath. In particular, should something go wrong with one of … Read more →

# Understanding Work With Data in Summer STEM Programs Through An Experience Sampling Method Approach

## by Joshua M. Rosenberg

This is Joshua Rosenberg’s dissertation […] Data-rich activities provide an opportunity to develop core competencies in both science and mathematics identified in curricular standards. Perhaps even more importantly work with data puts learners in the position to use data to ask and answer questions, a potentially empowering capability. Research on work with data has focused on cognitive outcomes and the development of specific practices at the student and classroom levels, and yet, little research has considered learners’ engagement. The present study explores learners engagement in work … Read more →

# Introducción a la Computación con GPUs usando R

## by Ronald Gualán Saavedra

Revisión de conceptos clave sobre la computación GPGPU, y algunos ejemplos simples de uso de librerías aceleradas por GPU […] Las GPU (Graphics Processing Units; Unidades de Procesamiento de Gráficos) son unidades de procesamiento diseñadas originalmente para procesar gráficos en una computadora rápidamente. Esto se hace teniendo una gran cantidad de unidades de procesamiento simples para cálculos masivamente paralelos. La idea de la computación de propósito general en GPU (GPGPU: general purpose GPU computing) es explotar esta capacidad para el cálculo general. En este tutorial se revisará … Read more →

# HPC con R para Investigadores

## by Johanna Orellana Alvear - johanna.orellana@ucuenca.edu.ec

HPC con R para Investigadores […] “Programmers waste enormous amounts of time thinking about, or worrying about, the speed of noncritical parts of their programs, and these attempts at efficiency actually have a strong negative impact when debugging and maintenance are considered.” — Donald Knuth. Optimizar código para hacerlo más rápido es un proceso … Read more →

# Macroeconomics

## by Mau-Ting Lin

This is a collection of the discussion lists from Macroeconomics. […] The theory contents will follow 1 closely. Item 2 is for data visualization. And item 3 is for general discussion regarding world news. https://goo.gl/kbQwP5 Class participation and quizzes: 10% Midterm Exam: 30% Final Exam: 30% Others Rhttp://www.r-project.org/ RStudiohttp://rstudio.org/ Github desktophttps://desktop.github.com/ … Read more →

# Data Visualization Project

## by Chiayi Yen

Data Visualization Project […] This study aims at investigating how the change of information dissemination process would affect the window-dressing behaviors of mutual fund managers. By convention, window-dressing is defined as the portfolio manipulations right before the quarter-end date, when all the fund managers are required to disclosure their holding firms of that date. Over the past decades, technological progresses largely change the way how information disseminates, and these further influence the information flow of capital markets. For example, the implementation of “Electronic … Read more →

# «Two Lives» by Concordia Antarova

## by gorodnichy

«Two Lives» by Concordia Antarova: text translation and analysis […] This work presents the working draft of the English translation of the “The Lives” book by Concordia Antarova. Widely known in Russian speaking spheres, and translated into French, this book remains to be largely unknown to English speaking population, despite its significant spiritual importance, comparable to that of “Book of Joy”. While the efforts on translating this book into English continue, here the draft of it is used for Artificial Intelligence (AI) projects, aiming at building the systems for automated analysis … Read more →

# «Кубатура Шара»

## by Андрей Городничий

Poetry of Andrey Gorodnichy […] Версия для печати: PDF, EPUB. Online: https://bookdown.org/gorodnichy/andre. … Read more →

# Foundations of Statistics with R

## by Darrin Speegle

This book is written for the purposes of teaching STAT 3850 at Saint Louis University. […] This is a book on probability and statistics suitable for the sophomore or junior level at university. We assume knowledge of calculus at the level of Calculus II. We do not assume prior experience with statistics or programming, though students who have no experience with either statistics or programming before starting this class should expect to have to work hard. We will be using R as an integral part of the exposition — you should not read this book without first getting R Studio installed. We … Read more →

# Thucydides the Neorealist?

## by J.W.Biggs

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Thucydides has long been viewed as an early exemplar of realist thinking in International Relations Theory. More recently, neorealist authors have claimed that Thucydides’ History offers insights into the importance of the anarchy in shaping interstate relations, and should be recognised as a neorealist. This neorealist appropriation has met substantial criticism and many revisionist scholars have urged a re-examining of Thucydides. This dissertation serves … Read more →

# 圖書推薦系統

## by 林凱浩

碩士論文 […] 電子書網址： Github repo:https://github.com/tpemartin/thesis-book-recommendation bookdown使用者說明 LyX 註冊RSconnect bookdown::publish_book()或bookdown::publish_book(account="帳戶名”) … Read more →

# 通識課程推薦系統

## by 蔡國梁

碩士論文 […] 電子書網址： Github repo:https://github.com/tpemartin/thesis-GE-recommendation bookdown使用者說明 LyX 註冊RSconnect bookdown::publish_book()或bookdown::publish_book(account="帳戶名”) … Read more →

# 網頁外掛功能：GA,Share,Comments

## by 國立臺北大學經濟學系-經濟時事與多媒體出版

迷你課程 […] 電子書網址：https://bookdown.org/tpemartin/minicourse-webplugins/ 首先你必需： 在Atom: 點privacypolicy.html 將以下兩個訊息換成你的訊息 https://your_website_url your_email … Read more →

# Hello Py: Python 程式設計

## by Pyradise

Pyradise 是專注於 Python 教學的團隊，致力於分享學習經驗，推廣資料科學，人工智慧，讓更多人能參與到這波資訊與人工智慧的學習浪潮。 專注於技術，熱衷於教學的開發者，希望透過教學，傳遞出更多想法的帽子哥。 資料科學與推廣教育的愛好者，閒暇時喜歡長跑與乒乓球；是 2017 iT邦幫忙鐵人賽 Big Data 組冠軍。 前端工程師與設計師。 … Read more →

# Introduction to Digital Currency

## by J.W.Biggs

A summary of research conducted hitherto. […] This is research I have conducted for personal use. Using the bookdown package has enabled me to piece together my research in a quick and neat manner. I have tried to convey complex terms as simply as possible utilizing visual examples where I can. Constructive criticism is welcomed - I will regularly be updating this … Read more →

# 認識 R 的美好

## by 郭耀仁

是郭耀仁，資料科學與推廣教育的愛好者，喜歡使用 R 語言與 Python 做資料科學應用，在台大資工系統訓練班開設多門 R 語言與 Python 的相關課程，亦與企業合作提供客製化的內訓課程；同時也是一個超棒的中文資料科學專欄 DataInPoint 的主編；這個專欄與波士頓的資料科學教學團隊 DataCamp 有行銷合作（Affiliate Marketing）。 如果您有 R 語言、Python、資料科學、教學、專案或顧問的需求，可以 email 與我聯絡：tonykuoyj@gmail.com R 語言是一個高階的統計程式語言，她在 2017 IEEE 調查中排名位於第 6 名，1是以資料分析為主要目的程式語言中的最高位。其他熟為人知的像是 Matlab 排名在第 15 名、SQL 排名在第 23 名、 Julia … Read more →

# R for Social Scientists

## by Paul C. Bauer, Rudolf Farys

Script for a an R course at the European University Institute. … Read more →

# Meu log de leitura de R for Data Science

## by Marcos V. C. Vital - LEQ-UFAL

Meu log de leitura de R for Data Science […] Se tem alguma pessoa que pode ser considerada um “pop star” do R, seria o Hadley Wickham: o cara é responsável pelo ggplot2 e pelo dplyr, que são alguns dos pacotes mais populares do R! Mas são justamente pacotes que eu quase não uso… :( Deixe eu explicar melhor. Eu sou usuário do R há muitos anos (fiz as contas de cabeça enquanto eu escrevo, e se não me enganei, agora em 2018 seriam uns 13 ou 14 anos!), então já tem um bocado de tempo que aprendi a como resolver (e ensinar) algumas coisas. Até aí tudo bem. Acontece que o Hadley trouxe uma … Read more →

# Lecture Notes voor Business Process Management (3637)

## by B. Depaire

Dit zijn de lecture notes van het opleidingsonderdeel Business Process Management […] Dit document bevatten de lecture notes voor het opleidingsonderdeel Business Process Management (3637), gedoceerd aan de Universiteit Hasselt. Deze lecture notes dienen ter ondersteuning van de colleges en bevatten zowel een “bullet-point” samenvatting van de voornaamste topics alsook een verzameling van bronnen voor verdere verdieping in de … Read more →

# ggplot2 介紹

## by 林茂廷老師

ggplot2 介紹 […] hypothes.is: https://hypothes.is/groups/eBBqEGde/minicourse-ggplot2 要在hypothes.is貼上程式碼時，請依下例張貼： ggplot2 cheatsheet Computing for the Social Sciences, U.Chicago. ggplot2part of the … Read more →

# Brief introduction to Statistic

## by Daxue Consulting

Brief introduction to Statistic […] Many statistical quantities derived from data samples are found to follow the Chi-squared distribution. Hence we can use it to test whether a population fits a particular theoretical probability distribution. In this section, we consider a multinomial experiment with k outcomes that correspond to categories of a single qualitative variable. The results of such an experiment are summarized in a one-way table. The term one-way is used because only one variable is classified. Typically, we want to make inferences about the true proportions that occur in the … Read more →

# Lösningar i R till vissa uppgifter från övningskompendierna (samt lite annat kul)

## by Erik Stenberg

Lösningar för vissa uppgifter i kursen Statstik A4/A8 […] Detta dokument är till för dig som läser kursen Statistik A4/A8 och är nyfiken på R. Innehållet är tänkt att förena lite nytta (lösa uppgifter) med nöje (lära dig lite R). Det är inte meningen att detta dokument skall fungera som en heltäckande introduktion till programmeringsspråket R. Det finns mängder av väldigt välskrivna guider online som fokuserar mycket mer på hur språket är uppbygt. Lyckligtvis är R väldigt enkelt att komma igång med, och det krävs inte mycket förståelse för själva språket för att göra enkla beräkningar, … Read more →

# Lecture Notes voor Exploratieve en Descriptieve Data Analyse

## by B. Depaire

Dit zijn de lecture notes van het opleidingsonderdeel Exploratieve en Descriptieve Data Analyse […] Dit boek bevat de lecture notes voor de cursus “Exploratieve en Descriptieve Data Analyse” (1ste Ba Handelsingenieur/Handelsingenieur in de Beleidsinformatica) aan de Universiteit Hasselt. Het idee van dit document is een begeleidende tekst aan te reiken ter ondersteuning van de slide-decks die gebruikt worden tijdens de hoorcolleges. Deze tekst is “bullet-point” gewijs opgebouwd en helpt het verhaal dat tijdens het hoorcollege wordt verteld terug op te roepen. Daarnaast zal er per hoofdstuk … Read more →

# 统计学习方法 – 基于R的算法实现

## by lcrfromfzu@qq.com

This is a minimal book created by using the bookdown package. The output format for this little book is bookdown::gitbook. […] 本文档的发布依赖于bookdown包, 对作者表示感谢! 文档所写R代码的依据算法来源于统计学习方法(李航著), 对作者表示感谢! 文档章节内容具体包括: … Read more →

# 網頁入門

## by 林茂廷老師

網頁入門 […] 語法查詢： … Read more →

# useR! Machine Learning Tutorial

## by Erin LeDell

useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive. […] useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive This tutorial contains training modules for six popular supervised machine learning methods: Here are some practical, related topics we will cover for each algorithm: Instructions for how to install the necessary software for this tutorial is available here. Data for the tutorial can be downloaded… Certain algorithms don’t scale well when there are millions of features. For example, decision trees require computing some sort of metric (to determine the splits) on all … Read more →

# Noções de Inferência no R

## by Thalita do Bem Mattos

Esta apostila é uma ferramenta de apoio às aulas teóricas de ME319-Noções de Inferência. […] O objetivo desta apostila é apresentar os conceitos de inferência ministrados em sala de aula na disciplina ME319 - Noções de Inferência de uma forma prática e intuitiva utilizando recursos computacionais como o software R. ME319 - Noções de Inferência - IMECC/UNICAMP Após fazer a instalação do R vamos instalar o RStudio. O RStudio é uma nova interface para o R com diversas propriedade que facilita o uso do … Read more →

# Novel methods for dose–response meta-analysis

## by Alessio Crippa

Novel methods for dose–response meta-analysis […] A single experiment can hardly provide a definitive answer to a scientific question. Science is oftentimes referred to as a cumulative process where results from many studies, aiming to address a common question of interest, contribute to create and update the scientific evidence. In the cumulative paradigm, meta-analysis is the statistical methodology to combine and compare the current evidence in the field. This process lies at the heart of the concept of evidence-based medicine and plays a major role in policy and decision making. … Read more →

# An R Platform for Social Scientists

## by Burak AYDIN, James ALGINA, Walter LEITE, Hakan ATILGAN

R book for social scientists […] The online version of this platform is licensed under the CC0 by Burak AYDIN. We aim to create a platform for the applied social scientists in which we can demonstrate basic statistical procedures using R (R Core Team 2016b) and real data. We prefer to name this material as a platform given that (a) it is open for contribution, (b) it will have dynamic content and (c) it can serve as a mainboard for Plug-ins and Add-ons . This R material is created with Bookdown (Xie 2016), an advanced system constructed on R Markdown (Allaire et al. 2016) and the R … Read more →

# Sosyal Bilimler R Platformu

## by Burak AYDIN, James ALGINA, Walter LEITE, Hakan ATILGAN

Sosyal Bilimler R Platformu […] Bu platformun hakları korunmuştur CC0 by Burak AYDIN. Bu materyal İngilizce olarak hazırlanıp Türkçeye çevirilmiştir. Bu platform sosyal bilimler alanında çalışan ve nicel veri analizlerinin teoriden ziyade uygulama aşamasına ilgi gösteren araştırmacılar için oluşturulmuştur. Bütün istatistiksel prosedürler R (R Core Team 2016b) ile yürütülmüş, gerçek veri kullanımına özen gösterilmiştir. Bu materyale platform denilmesinin üç sebebi vardır, (a) katkıya açıktır,(b) dinamik bir içeriğe sahiptir, (c) bilgisayar anakartı gibi kullanılabilir, R ile oluşturulmuş … Read more →

# Numerical Analysis: Notes

## by Brynjólfur Gauti Jónsson

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a collection of my notes and algorithms from a course on Numerical Analysis at the University of Iceland. The book used in the course was Numerical Analysis by Timothy … Read more →

# The Final War

## by Beckett Stephens

My book about adventures in a video game named Minecraft. […] This book is about my adventures in a video game named … Read more →

# YaRrr! The Pirate’s Guide to R

## by Nathaniel D. Phillips

An introductory book to R written by, and for, R pirates […] The purpose of this book is to help you learn R from the … Read more →

# Lab notes for Statistics for Social Sciences II: Multivariate Techniques

## by Eduardo García Portugués

Lab notes for Statistics for Social Sciences II: Multivariate Techniques […] Welcome to the lab notes for Statistics for Social Sciences II: Multivariate Techniques. Along these notes we will see how to effectively implement the statistical methods presented in the lectures. The exposition we will follow is based on learning by analyzing datasets and real-case studies, always with the help of statistical software. While doing so, we will illustrate the key insights of some multivariate techniques and the adequate use of advanced statistical software. Be advised that these notes are neither … Read more →

# Estimacdiión en dominios pequeños

## by Grupo SAE - USTA

Estedd lidbro plantea una introducción a la estimación de áreas pequeñas con el software R. […] Este libro plantea una introducción a la estimación de áreas pequeñas con el software R. xxxx vv zz second commit in Github This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need to … Read more →

# 高中生的未來事件簿

## by tpemartin

高中生的未來事件簿 […] 大學是介於出社會以及在課堂學習的重要期間，有很多最悸動人心故事發生在這時候，奔逸而鮮豔，比起高中多了些許成熟，比起大人又看似稚嫩。雖每個人進到大學的目的不同，但總歸在追求真理的路上。未來個性將奔放如驕陽抑或凝靜如冷池；思考將高瞻遠矚抑或目光如寸；信念將真摯誠實抑或厚顏無恥，先是家庭給你的教育再來是你在大學對自己的訓練。 我們帶你從各個觀點切入核心“將自己成為比昨天更好的人”希望對你有些許的幫助，我們只是野路上的螢火，無法指引更遠的道路，但你求知求真求善的心，才是皎潔的月光，照射整片原野，螢火起能與日月爭輝？ 歡迎你來到這個網站，誠摯邀請你閱讀我們的作品。 … Read more →

# Financial Engineering Analytics: A Practice Manual Using R

## by William G. Foote

This book explores the fundamentals of financial analytics using R and various topics from finance. […] Science alone of all the subjects contains within itself the lesson of the danger of belief in the infallibility of the greatest teachers of the preceding generation. - Richard Feynman This book is designed to provide students, analysts, and practitioners (the collective “we” and “us”) with approaches to analyze various types of financial data sets, and to make meaningful decisions based on statistics obtained from the data. The book covers various areas in the financial industry, from … Read more →

# R och Demoskop

## by Filip Wastberg

Det här är ett dokument för att komma igång med R på Demoskop […] Det här är ett dokument om R på Demoskop. R är ett programmeringsspråk för statistisk analys. På Demoskop används R i huvudsak som ett komplement till den programmering som vi gör SAS och SPSS. Det här dokumentet är anpassat efter våra arbetssätt på Demoskop. Några generella förkunskaper behövs inte. Däremot så rekommenderar vi att du efter att du gjort installationen gör den här kursen på datacamp.com. Det är enkel introduktion till R och några paket som underlättar arbetsflödet. Datacamp är en bra hemsida för att lära sig … Read more →

# Github 介紹

## by 林茂廷老師

Github 介紹 […] 這裡我們用非程式設計者懂的說法來解釋，故不符合它們原始的完整定義。 Github.com: 一個【雲端空間】讓你儲存備份用 Github Desktop: 安裝在你電腦上的【備份小精靈】，透過他，你可以選擇將某個資料匣裡的東西備份在自己電腦，或進一步備份在Github.com雲端空間。 我們先假設你已經在Github.com（以下簡稱.com）註冊了一個帳號，也在你電腦安裝了Github Desktop（以下簡稱Desktop），並把Desktop設定好可以和你的.com帳號連結。 … Read more →

# Functional programming and unit testing for data munging with R

## by Bruno Rodrigues

This book is an introduction to functional programming and unit testing with the R programming language, for the purpose of data muning […] This book is still being written, some chapters are not finished yet, and there might be (there are) some typos. Don’t hesitate to write to me if you notice something weird. You can purchase a digital copy of this book at leanpub. The version on Leanpub will not always be up-to-date, I only update it when I made very big changes (new chapters, etc). But once this book will be finished, both version are going to be the same. This book serves to show how … Read more →

# R Markdown 介紹

## by 林茂廷老師

dplyr 介紹 […] 一個標準化的純文字語法（syntax），用來表達豐富的排版意境。 Wiki範例 本身不會產生word, html或pdf檔，而是透過其他應用程式，如pandoc，來進一步生成相關文件格式。 … Read more →

# dplyr 介紹

## by 林茂廷老師

dplyr 介紹 […] … Read more →

# IRT (GMMSGE01): Parametric IRT (dichotomous data)

## by Jorge N. Tendeiro

IRT (GMMSGE01): Parametric IRT (dichotomous data) […] Parametric item response theory (IRT) provides a theoretical framework that allows modeling the relationship [\text{item} \longleftrightarrow \text{person}] by means of a mathematical function: [P(X_i = c|\theta_n) = f(\theta_n)] (X_i) is the random variable denoting the answer to item (i), with discrete response categories; (\theta_n=) (n^\text{th}) person’s trait parameter. This is the item response function (IRF). The IRF is therefore a function relating the latent trait to the probability of answering the item correctly. … Read more →

# Economic Forum

## by Mau-Ting Lin

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. … Read more →

# Shiny (I)

## by 林茂廷老師

Shiny (I) … Read more →

# Simulation And The James-Stein Estimator In R

## by Alex Hallam

Simple Simulation and the James-Stein Estimator […] This is the website for “Simulation And The James-Stein Estimator In R”. This technical document is short, covering some common ways to generate data and exploring the James-Stein Estimator. This will teach you how to do run simulations to observe the properties of the James-Stein Estimator in R — specifically using the tidyverse: You’ll learn how to generate data to prove theoretical results. In the computer age of statistics the data scientist has the power of machines to run simulations for testing a methods before putting a method into … Read more →

# Data visualization

## by Mau-Ting Lin

This is a collection of data visualization handouts from Macroeconomics. … Read more →

# Muestreo y análisis de estudios educacionales con R

## by Andrés Gutiérrez

Este es el repositorio del libro Diseño y análisis de estudios educacionales. […] Las fórmulas computacionales requeridas para estimar la varianza de estadísticas descriptivas como la media muestral están disponibles para algunos diseños complejos que incorporan elementos como la estratificación y el muestreo por conglomerados. Sin embargo, en el caso de estadísticas analíticas más complejas, tales como coeficientes de correlación y coeficientes de regresión, no se encuentra fácilmente las fórmulas específicas en diseños muestrales que se aparten del muestreo aleatorio simple. … Read more →

# Selected Solutions to R4DS Exercises

## by Chunji Wang

This book provides selected solutions to the exercises in the wonderful book R for Data Scienceby Wickham Hadley. […] This is the website for “Selected Solutions to R4DS Exercises”. This is a joint advanture between Chunji Wang, Ron, Luna, Zhiyin, Chengcheng…. We started the “R4DS Study Club” on Sep 22nd, 2017; If you want to join us, please contact us! The chapter labels in this book is the same as the original R4DS book; go to the corresponding chapter for solutions. You might need to read the beginning of the chapter to load some packages or create some variables that are … Read more →

# R bookdownplus Textbook

## by Peng Zhao

A tutorial to R bookdownplus, an extension of R bookdownpackage. This books shows helps you write academic journal articles, guitar books, chemical equations, mails, calendars, and diaries, on the basis of R bookdown. […] A book titled R bookdownplus Textbook is surely talking about ‘bookdownplus’ (Zhao 2017b), but let’s start with ‘bookdown’ (Xie 2016). ‘bookdown’ is a software package for writing books or documents based on R language (R Core Team 2016) and Markdown syntax. It is something like Microsoft Word, but more elegant, more powerful, and … Read more →

# Guide til klinikophold

## by Søren O’Neill

Denne side tjener som vejledning og inspiration til supervisorer og studerende på den præ-graduate, kliniske uddannelse på klinisk biomekanik. […] På disse sider finder du vejledning til de præ-graduate kliniskeophold for kiropraktor-studerende (stud. kand.manu) De præ-graduate klinikophold er opdelt som illustreret herover; med et præ-klinisk kursus på SDU efterfulgt af en ‘clinic-entrance’ eksamen, et længere ophold på rygcenter og 2 mindre ophold i andre regi. Teksten er opdelt i to hovedsektioner – én som primært er skrevet med supervisorerne for øje og én for studerende. Begge … Read more →

# Applications of Multivariate Analysis in Business

## by Ed Anderson

This document describes the concept of Mass Customisation as it applies to Business Analytics and provides case study implementations of R Studio […] It has been great being part of the Analytical Community the last few years. The excitement is everywhere about “big-data”,“data-science”,“MOOCs”. The talent being attracted into Analytics is awe inspiring.One current trend is ‘a shift from a desire to work for bigger name brand companies like Facebook or Google, to more mission-driven organizations attempting to make an impact on society. Whether it is curing cancer, conserving energy, … Read more →

# ABJ Syllabus

## by Associação Brasileira de Jurimetria

A track of papers we read and papers we collect to read in future. […] Para que o seu bookdown funcione tanto na web quanto no pdf, você deve evitar usar marcadores que dependem do contexto. Para fazer citações você deve usar (Weinstein 1997) ou Weinstein (1997). Isso também funciona pra pacotes (R Core Team 2017) ou R Core Team (2017). Para criar uma figura, é preferível que você use o print padrão do knitr. A label do gráfico será fig:label-do-chunk. Você pode citar fazendo 1.1. Se você precisar importar uma imagem de fora do R, é melhor que você faça , a despeito do que diz o Yihui. … Read more →

# Studieren und Forschen mit dem Internet

## by Peter Baumgartner, Sabine Payr

Arbeitsprozesse und Werkzeuge des wissenschaftlichen Arbeitens. Gekürzte Ausgabe aus 2001, aber viele Inhalte noch aktuell. […] Studieren und Forschen mit dem Internet von Peter Baumgartner & Sabine Payr ist lizenziert unter einer Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International Lizenz.Über diese Lizenz hinausgehende Erlaubnisse können Sie unter http://peter.baumgartner.name/kontakt erhalten. Studieren und Forschen mit dem Internet ist 2001 beim StudienVerlag herausgekommen und heute vergriffen. Restexemplare können nach wie vor gebraucht über Amazon … Read more →

# Mastering Software Development in R

## by Roger D. Peng, Sean Kross, and Brooke Anderson

The book covers R software development for building data science tools. As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will obtain rigorous training in the R language, including the skills for handling complex data, building R packages and developing custom data visualizations. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers. Read more →

# Course Notes for IS 6489, Statistics and Predictive Analytics

## by Jeff Webb

Course notes for IS 6489. […] These are the course notes for IS 6489, Statistics and Predictive Analytics, offered through the Information Systems (IS) department in the University of Utah’s David Eccles School of Business. This is an exciting time for data analysis! The field has undergone a revolution in the last 15 years with increases in computing power and the availability of “big data” from web-based systems of data collection. “Data science” is the umbrella term that describes the result of this revolution—a new discipline at the intersection of many traditional fields such as … Read more →

# Lokal lagring og bruk av sensitive data

## by Are Edvardsen, SKDE

Veiledning i installasjon og bruk av VeraCrypt for sikker lagring og sletting av data ved Senter for klinisk dokumentajson og evaluering (SKDE), Helse Nord RHF. […] Analyse av sensitive og tidsavgrensede data inngår som en del av de praktiske oppgaven SKDE har. Egenskapene til slike data vil typisk være at de kun skal nås av en begrenset og definert gruppe av brukere samt at de effektivt må kunne slettes ved gyldighetsperiodens utløp. Dette gir noen spesielle utfordringer når brukere samtidig skal kunne arbeide effektiv og dele slike data seg imellom. Typisk for analysevirksomhet er også at … Read more →

# Probability and Statistics

## by Rob Carroll

These are the lecture notes for POS 5737, the introductory probability and statistics class in the graduate program in political science at Florida State University. […] These are the notes for POS 5737, taught in the Department of Political Science at Florida State University. They freely borrow from several well-known textbooks, including those by Wackerly, Mendenhall, and Scheaffer (2008), DeGroot and Schervish (2012), and Casella and Berger (2002). They also borrow from my own notes as a graduate student when I was taught by Kevin Clarke. Kevin was kind enough to provide his own old … Read more →

# R tips: 16 HOWTO’s with examples for data analysts

## by Lingyun Zhang

R tips: 16 HOWTO’s with examples for data analysts […] … Read more →

# ModernDive

## by Chester Ismay and Albert Y. Kim STARRING FRANK MCGRADE

An open-source and fully-reproducible electronic textbook bridging the gap between traditional introductory statistics and data science courses. […] Help! I’m new to R and RStudio and I need to learn about them! However, I’m completely new to coding! What do I do? If you’re asking yourself this question, then you’ve come to the right place! Start with our Introduction for Students. This is version 0.2.0 of ModernDive published on August 02, 2017. For previous versions of ModernDive, see Section 1.4. This book assumes no prerequisites: no algebra, no calculus, and no prior programming/coding … Read more →

# R Studio: A 3D Printer for Business Analytics

## by Ed Anderson

This document describes the concept of Mass Customisation as it applies to Business Analytics and provides case study implementations of R Studio […] Good Morning! How are you doing? It’s been great being part of the Analytical Community the last few years hasn’t it? The excitement is everywhere about “big-data”,“data-science”,“MOOCs”. I have been blown away by the talent being attracted into Analytics.One current trend is ‘a shift from a desire to work for bigger name brand companies like Facebook or Google, to more mission-driven organizations attempting to make an impact on society. … Read more →

# Data Science in Educational Research

## by Joshua M. Rosenberg

This is an introduction and tutorial for data science in educational research. … Read more →

# Papa’s Three Laws

## by 大鹏&朋友

This is a selection of a papa’s diary originally posted on my blog. A family’s stories of two children are told. This book is being updated. […] 我家有两个娃。大的是男孩，生于北京，唤作京生; 小的也是男孩，生于德国，唤作德生。 本书讲述的是我和我的朋友们的育儿和家庭故事。 … Read more →

# Data Science and Visualizations with R

## by Jonathan Wong

Data Science and Visualizations with R […] This is a course on the use of tidyverse packages tidyverse provides a complete suite of modern data-handling tools. It is an essential toolbox for any data scientist using R. The tidyverse package is designed to be easy to install. This course will dive into using tidyverse. It will assume you have already installed r and rstudio and how some familiarity on how to use the rstudio. This book will use the nycflights13 dataset This package contains information about all flights that departed from NYC in 2013: 336,776 flights with 16 variables. To … Read more →

# R語言套件之道

## by Cheng-Chung Li

寫這本書有幾個原因，除了將自己對於R語言所學做一個整理之外，還希望能讓更多人將自己製作的R functions包裝成符合CRAN規定的R packages, 放到網路上讓大家共享。其實國內R語言的高手很多，但是卻很少人知道如何將寫好的functions包成套件，而國內的參考資料也不多。因此頂多只能在公司內部分享，這實在是很可惜的事情。 我也希望藉由這樣的資料，讓臺灣的高手們被世界看到，一起為R語言貢獻一份心力。 […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one … Read more →

# (Very) basic steps to weight a survey sample

## by Josep Espasa Reig

(Very) basic steps to weight a survey sample […] This is an introductory guide to survey weighting. It provides a step-by-step walkthrough of the main procedures and explains the statistical principles behind them. The guide includes R code to implement all stages of survey weighting and reproduces the weighting procedures of the 7th European Social Survey in the UK. This text avoids technical notation and language and is targeted to social scientists with a basic level of statistics and probability theory. Readers without knowledge of R should be able to benefit from this text as it … Read more →

# The Unix Workbench

## by Sean Kross

The Unix Workbench […] Cover Image: A Goldsmith in his Shop by Petrus Christus This work by Sean Kross is licensed CC0. Zero rights … Read more →

# Gopnik Guide to Biology

## by Paulius Alaburda

Bandymas sukurti lengvą biologijos elektroninę knygą. […] Internetas Lietuvos švietimo įrankiams kol kas turėjo mažai įtakos. Vietoje popierinės knygutės atsirado elektroninės knygutės, pasiruošti valstybiniams egzaminams atsirado programėlės. Bet šie įrankiai susiję su kontrolės struktūromis sekti moksleivio progresą ir įvertinti, ar jis teisingai pasiruošė egzaminui. Tikra edukacija prasideda ne nuo pažymių ar atsiskaitymo po 12 metų, o pirminio klausimo - kodėl? Pirminė nuostaba, jog aplinka neatitinka mūsų vidinio realybės modelio pastumia imtis veiksmų išsiaiškinti, kur mes klydome ir … Read more →

# Notes

## by Miao YU

This is notes from yufree […] 这里的笔记主要来自于公开课笔记与相关教材的读书笔记，主题相对分散，但这些知识应该为当今科研人员的基本技能。 首先科研人员要有一定的数学与统计学功底，这是最最基本的工具学科。微积分、线性代数与数值方法是必须的数学工具，统计学工具则至少明白如何进行统计推断与预测。其余的要看应用，例如数论对密码学而言就是基础。 然后就是编程技能，编程方面首先要熟悉编程的思维方法，例如递归、迭代、条件语句等，也就是知道机器怎么运转。其次就是掌握一门高级语言，例如R、python或matlab，这样你可以快速实现自己的想法。 之后就是模型思维，懂得将实际问题抽象成一个概念问题或统计问题或仿真问 … Read more →

# Notes on R for AML100

## by Jordan T. Bates

Notes on R for the course AML100 at Arizona State University. […] These notes introduce the basics of the programming language R as needed for the course AML100. Notes on RStudio and R Markdown are included in … Read more →

# Underlagsrapport för En ännu bättre strålbehandling avseende incidens och prevalens av cancer i Västra Sjukvårdsregionen 2016-2030

## by Erik Bülow

Förutsägelse av framtida förekomst av cancer i Västra Sjukvårdsregionen. […] Rapporten presenteras i tre format, samtliga med samma text- och bildmässiga innehåll men med något olika tekniska lösningar. Om du läser denna rapports HTML-version så når du övriga format via nedladdningsikonen i sidhuvudet (se figur … Read more →

# 液体活检口袋书

## by Dr.Thunder, Ming, Youcai

Liquid biopsy pocket book (in Chinese), written by Bioinformatics engineers. […] 海普洛斯推出【液体活检口袋书】专栏，对液体活检进行系统、全面的介绍。每周三更新，向大家介绍关于液体活检的一切。 … Read more →

# Detecting collusion in goverment procurement contracts

## by Manuel Aragonés, Thalía Guerra, Roberto Sánchez and Mónica Zamudio

This publication is the result of five months of work for our Data Product Architecture class project. […] Since 2002, the Mexican Federal government handles most of its procurement biddings through a transactional platform called Compranet. Even though most of the information in the platform is public, authorities and organizations dedicated to fight corruption do not have a technical framework to better allocate their resources into cases. Our project consisted in developing an interactive dashboard for investigators to track particular contracts and to filter out low-risk … Read more →

# GuitaR Bookdown

## by 大鹏

This is a collection of my favorite songs with guitar chords, produced by bookdown. […] 最真的梦，就是用R语言的bookdown把R代码、作图、数据分析和吉他谱弄到一起。 啥？弄到一起有什么用？ 呃……容我清清脑子想一想…… 越过下面这座山丘，却发现无人等候…… 终会有一天 把心愿完成 带着你飞奔找永恒 [\int_0^\infty e^{-x^2} dx=\frac{\sqrt{\pi}}{2}] 本书的吉他谱，在网页上看不见，只有点击下载pdf才能看见哦。 … Read more →

# Föll í R - Dæmi

## by Eyþór Björnsson

Föll í R - Dæmi […] Hér eru dæmi um notkun á föllum sem ég hef skrifað og má finna á GitHub. Þetta eru aðallega föll sem spara mikinn tíma við uppsetningu á algengum töflum fyrir vísindagreinar (á sviði læknavísinda) en eru líka hjálpleg til þess að átta sig á fylgni milli mismunandi breyta í gagnasafninu. Þessi síða er búin til með bookdown. Það er frábær pakki sem tvinnar saman R markdown skrár og setur saman í aðgengilegt html-bókarsnið. Í öllum dæmunum er notast við ‘diabetes’ gagnasettið sem er aðgengilegt frá http://biostat.mc.vanderbilt.edu/wiki/Main/DataSets. The data consist of 19 … Read more →

# Egils saga Skalla-Grímssonar

## by eythorbj

Egils saga Skalla-Grímssonar […] Texti Egils sögu var afritaður af vefsíðu The Icelandic Saga Database (sótt 15. maí 2017) og útbúinn fyrir birtingu hér með R markdown og bookdown pakkanum í R. Eyþór Björnsson, 15. maí … Read more →

# Literature thesis: Building a framework for retrieving information on multispecies interactions from published literature

## by Gabriel Muñoz

Literature thesis: Building a framework for retrieving information on multispecies interactions from published literature […] The generation of new global hypothesis, destined to understand our current global biodiversity crisis, requires a large amount of information. Our knowledge in Ecology is principally contained in the form of published articles. This global body of literature holds a significant amount of primary data on species distributions and interactions across a large geographical and temporal scale. In this literature review, I explore the use of different computational tools … Read more →

# The Art of Data Science

## by Roger D. Peng and Elizabeth Matsui

The book covers R software development for building data science tools. As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will obtain rigorous training in the R language, including the skills for handling complex data, building R packages and developing custom data visualizations. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers. Read more →

# An approach to identify the sources of low-carbon growth for Europe

## by Georg Zachmann, Bruegel gz@bruegel.org, Robert Kalcik, Bruegel robert.kalcik@bruegel.org

This website serves to illustrate the findings of the policy contribution ‘An approach to identify the sources of low-carbon growth for Europe’ and allows a deeper dive into the underlying data. […] This website serves to illustrate the findings of the policy contribution “An approach to identify the sources of low-carbon growth for Europe” (Zachmann 2016) and allows a deeper dive into the underlying data. The website is focused on presenting figures and deliberately only offers short descriptions and interpretations. The research underlying this report has been financially supported by the … Read more →

# Notas sobre Estimación Puntual

## by Peter Olejua

Se desarrolla el tema de estimación puntual para el curso Métodos en Bioestadística I perteneciente al Maestría en Bioestadística de la Universidad Javeriana […] En las siguientes páginas se desarrolla brevemente el tema de estimación puntual. Forma parte de una evaluación para el curso Métodos en Bioestadística I, perteneciente al Maestría en Bioestadística de la Universidad Javeriana. Este trabajo puede usado como una introducción, a manera de notas de clases o como un inicio de colaboración a un escrito más amplio y completo sobre estimación puntual. Cualquier crítica, aporte y/o … Read more →

# An approach to identify the sources of low-carbon growth for Europe

## by Georg Zachmann

Draft website for the European Climate Foundation […] This website serves to illustrate the findings of the policy contribution “An approach to identify the sources of low-carbon growth for Europe” (Zachmann 2016) and allows a deeper dive into the underlying data. The website is focused on presenting figures and deliberately only offers curt descriptions/interpretations. It is currently structured into five chapters but we plan to extend it when further steps of our analysis become available. The research underlying this report has been financially supported by the European Climate … Read more →

# Advances on the analysis on connectivity of Raphia taedigera palm swamps for Central America

## by Gabriel Muñoz

Advances on the analysis on connectivity of Raphia taedigera palm swamps for Central America … Read more →

# Data lunch 2feb: The use of Bookdown to write documents and reports

## by Gabriel Muñoz

Data lunch 2feb: The use of Bookdown to write documents and reports […] Make sure you have installed the latest version of R and the Preview Release of RStudio. The following packages should be installed. If you have them already make sure they are updated. The most up to date versions are the “in development” versions from gitHub. Do you have Pandoc installed? RStudio should come along with Pandoc. and latex ? ( if you want to have PDF outputs as well) note that PDF does not allow interactive plots If you do not have latex installed Mac OS X –> MacTeX (http://www.tug.org/mactex/) Linux … Read more →

# 기초통계 개념정리

## by 김진섭

This is a basic statistics book written by JSKIM. […] This is a basic statistics book written by Jinseob … Read more →

# Revealed comparative advantage and network centrality

## by Sergej Kaiser

Revealed comparative advantage and network centrality … Read more →

# A Minimal Book Example

## by Yihui Xie (Daniel Kim translated it into Korean)

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need to install XeLaTeX. 이 예제를 PDF로 컴파일하려면, XeLaTeX을 … Read more →

# R aplicado à Biologia: uma introdução descomplicada e divertida!

## by Marcos Vital, do Laboratório de Ecologia Quantitativa da UFAL

Este é o livro ao vivo do blog Cantinho do R […] Este material foi construído com a ajuda de muitas pessoas que acreditam no LEQ e em Ciência Livre. Muito obrigado! Para mais material, visite o Cantinho do R Um prefácio da nova apostila do Cantinho do R Viva! Depois de uma longa demora (pelo menos para quem nos acompanha desde o começo), aqui está a nossa nova apostila do Cantinho do R! :D Se você é um recém chegado, acho que eu tenho que começar explicando aqui o que é, pra que serve e de onde nasceu este material, não é? É disso que se trata este primeiro capítulo. Mas não se preocupe, … Read more →

# Do not use averages with Likert scale data

## by Dwight Barry

This is a short overview of why averages don’t work well for evaluating Likert scale or other ordinal-scale data, and what to do instead, with examples using R. While the examples are focused on healthcare surveys, the lessons apply to any use of ordinal scale data. Note: all of the data in this document is fake, created specifically to illustrate particular points. Contact/Twitter: @healthstatsdude PDF version: Website: https://bookdown.org/Rmadillo/likert/ Corrections/Pull requests: https://github.com/Rmadillo/likert Cover image: Gustave Doré, 1863. Illustration 12 for Cervantes’s Don … Read more →

# Dengue Forecasting Project

## by Raghvendra Jain

This is a book that contains experiments and results about the predictions of dengue outtbreaks in Thailand. […] This is a sample book written in Markdown. For now, you have to install the development versions of bookdown from Github: … Read more →

# Efficient R programming

## by Colin Gillespie, Robin Lovelace

Efficient R Programming is about increasing the amount of work you can do with R in a given amount of time. It’s about both computational and programmer efficiency. […] This is the online version of the O’Reilly book: Efficient R programming. Pull requests and general comments are welcome. Colin Gillespie is Senior lecturer (Associate professor) at Newcastle University, UK. His research interests are high performance statistical computing and Bayesian statistics. He is regularly employed as a consultant by Jumping Rivers and has been teaching R since 2005 at a variety of levels, ranging … Read more →

# Interactive Data Visualization (2nd Day)

## by Paul C. Bauer & Richard Traunmüller

Script developed for a workshop at the CUSO doctoral school on the 4th and 5th November 2016. […] This document serves as slides and script for the second day of the workshop Data Visualization taught by Paul C. Bauer and Richard Traunmüller for the Programme doctoral en science politique (PDSPO) (Bern, 4-5 of November 2016). The present material is licensed under a Creative Commons Attribution-ShareAlike License 3.0. Regarding further use of this material contact Paul. Some of the material is inspired by the official shiny tutorial and Plotly for R by Carston Sievert. For potential future … Read more →

# ggplot2逆引き集

## by @kazutan

これはggplot2逆引き集です。 […] これはQiitaで公開されているggplot2逆引きの記事を集めたものです。今のところ，@kazutanが作成した12本をまとめています。 なにかありましたら，以下のGithubリポジトリのissueもしくはTwitterの@kazutanまでおねがいします。 … Read more →

# R Powered Web Applications with Shiny

## by Zev Ross (with Andrew Clark)

R Powered Web Applications with Shiny […] This is a book version, transcribed by Andrew Clark using RStudio’s bookdown package, of an extensive blog post by Zev Ross. The book version has the advantage of being available in several formats, more easily updated and downloadable. However, for an interactive version refer to the above mentioned blog … Read more →

# Premier League Annual

## by Andrew Clark

Premier League Annual […] This is an ‘on the fly’ annual based on the 2016/17 Premier League season, updated weekly with charts, tables, highlight videos and trivia related to the games played. Each chapter features static visualizations relevant to the games that week. Greatly extended, fully-interactive and constantly updated versions can be found on the accompanying dashboard site Additional data is available at the Premier League Web site Most of the underlying data is unofficial, unguaranteed error-free and available for a million dollars. There is also likely to be use of James … Read more →

# Handling Strings with R

## by Gaston Sanchez

This book aims to provide a panoramic perspective of the wide array of string manipulations that you can perform with R. If you are new to R, or lack experience working with character data, this book will help you get started with the basics of handling strings. Likewise, if you are already familiar with R, you will find material that shows you how to do more advanced string and text processing operations. Read more →

# Spark Social Science Manual

## by Research Programming, The Urban Institute

Spark Social Science Manual […] Let the sample mean, (\hat{\mu}), be the parameter estimate for our mean parameter (\mu) and the null hypothesis of the t-test be (H_0): (/mu = 0). The test statistic is given by (\hat{\mu} / (\hat{\sigma} / \sqrt{n})). Remember that the p-value is determined by the test statistic and the t-distribution with ((n – 2)) degrees of freedom in this case. By the Central Limit Theorem, (\sqrt{n}*(\hat{\mu}-\mu) \rightarrow N(0,\sigma^2)) as (n \rightarrow \infty), or written differently as (\hat{\mu} \rightarrow \mu + \frac{\sigma}{\sqrt{n}}N(0,1)) … Read more →

# Multivariate Analysis with Optimal Scaling

## by Jan de Leeuw, Patrick Mair, Patrick Groenen

In 1980 members of the Department of Data Theory at the University of Leiden taught a post-doctoral course in Nonlinear Multivariate Analysis. The course content was sort-of-published, in Dutch, as Gifi (1980). The course was repeated in 1981, and this time the sort-of-published version (Gifi (1981)) was in English. The preface gives some details about the author. The text is the joint product of the members of the Department of Data Theory of the Faculty of Social Sciences, University of Leiden. ‘Albert Gifi’ is their ‘nom de plume’. The portrait, however, of Albert Gifi shown here, is that … Read more →

# Econ 215 Notes

## by Salfo Bikienga

Lecture notes for my introduction to statistics class at University of Nebraska-Lincoln. […] This is supposed to be your first course in statistics. So the goal is to give you an overview of what statistics is, why it is a powerful thing to know, how you can use it to make informed decision or understand “numbers speak” people throw around in the news. At the end of this class, I hope: 1- You understand the importance of statistics; 2- You can better appreciate the numbers you get from the news; 3- You can perform your own analysis to inform yourself, and your collaborators. The explosion … Read more →

# Exploratory Data Analysis with R

## by Roger D. Peng

This book covers the essential exploratory techniques for summarizing data with R. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data you have. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing informative data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data. Read more →

# Getting used to R, RStudio, and R Markdown

## by Chester Ismay

An introduction into using R, RStudio, and R Markdown for new users […] In the HTML version of this book, you can also download the PDF version of the book by clicking on PDF button in the top toolbar of the page. HTML is the preferred format but the PDF format may be preferred for some readers. Links to the different GIFs directly found in the HTML version are provided in the PDF version. This resource is designed to provide new users to R, RStudio, and R Markdown with the introductory steps needed to begin their own reproducible research. A review of many of the common R errors … Read more →

# Руководство по data.table

## by Андрей Огурцов

Руководство по пакету data.table: перевод виньеток, справочная иформация. […] Вступление Данное руководство содержит переводы всех виньеток по пакету data.table. Все, кроме последней, переведены с версий от июня 2015 г.; последняя - с версии от апреля 2016 г. Переводы будут актуализироваться, также планируется добавить другие материалы. … Read more →

# Principles of Econometrics with R

## by Constantin Colonescu

This is a beginner’s guide to applied econometrics using the free statistics software R. […] … Read more →

# Chess Encounters

## by Andrew Clark and Joshua Kunst

Chess Encounters […] … Read more →

# Notes and Codes while Learning R

## by Scott Ming

这里是明生学 R 的笔记。 […] Bookdown 是个很赞的写作工具，在这里记录一些自己学习 R 与数据科学的笔记，如有错误，欢迎指出。我的 GitHub 地址：https://github.com/scottming … Read more →

# useR2016 Conference Videos

## by Andrew Clark

Chart, interactive table and a selection of videos from the useR2016 conference […] This acts as a repository for some of my favourite video talks from the recent useR2016 conference along with the ability to view any of the offerings via a clickable table. It is probably not the most effective of presentation but is a trial run for creating and deploying interactive books to bookdown.org Andrew Clark is an independent R developer based in North Vancouver He has for many years supplied statistical sports data on the web but with the interactive opportunities arising from the shiny framework … Read more →

# Scalable Machine Learning and Data Science with Microsoft R Server and Spark

## by Ali Zaidi, Machine Learning and Data Science, Microsoft

These are (tentatively) rough notes showcasing some tips on conducting large scale data analysis with R, Spark, and Microsoft R Server. The focus is primarily on machine learning with Azure HDInsight platform, but review other in-memory, large-scale data analysis platforms, such as R Services with SQL Server 2016, and discuss how to utilize BI tools such as PowerBI and Shiny for dynamic reporting, and report generation. Read more →

# Shiny Tutorial

## by Weicheng Zhu

This is a shiny tutorial. […] Some basic knowlege about the R lanuage is requred. It would be helpful if you have some basic knowlege about HTML, CSS and javascript, but they are not … Read more →

# Backtesting Strategies with R

## by Tim Trice

Backtesting strategies with R […] This book is designed to not only produce statistics on many of the most common technical patterns in the stock market, but to show actual trades in such scenarios. Test a strategy; reject if results are not promising Apply a range of parameters to strategies for optimization Attempt to kill any strategy that looks promising. Let me explain that last one a bit. Just because you may find a strategy that seems to outperform the market, have good profit and low drawdown this doesn’t mean you’ve found a strategy to put to work. On the contrary, you must work to … Read more →

# Praktiskā biometrija

## by Didzis Elferts

Piemēri darbā ar programmu R, lai risinātu statistikas problēmas bioloģijā. […] Praktiskā biometrija Šī grāmata ir mans mēģinājums samērā vieglā formā ar minimālu teorijas materiālu sniegt praktiskus padomus statistisko analīžu veikšanā biologiem. Tā kā uzsvars ir likts uz vārdu ‘’praktiski’’, tad lielāko grāmatas daļu sastāda piemēri tam, kā veikt katru no apskatītajiem statistiskajiem testiem. Plašāka teorētiskā pamatojuma iegūšanai noderēs citu autoru darbi. Nenoliedzami nopietnākais darbs latviešu valodā biometrijas jomā ir jāmin Liepa (1974) grāmata, angļu valodā tas būtu kāds no … Read more →

# A Minimal Book Example

## by Yihui Xie

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading … Read more →

# Block Relaxation Methods in Statistics

## by Jan de Leeuw

The book discusses block relaxation, alternating least squares, augmentation, and majorization algorithms to minimize loss functions, with applications in statistics, multivariate analysis, and multidimensional scaling. […] Many recent algorithms in computational statistics are variations on a common theme. In this book we discuss four such classes of algorithms. Or, more precisely, we discuss a single large class of algorithms, and we show how various well-known classes of statistical algorithms fit into this common framework. The types of algorithms we consider are, in logical order, … Read more →

# APL in R

## by Jan de Leeuw, Masanao Yajima

R versions of the array manipulation functions of APL are presented. We do not translate the system functions or other parts of the runtime. Also, the current version has does not have the nested arrays of APL2. […] APL was introduced by Iverson (1962). It is an array language, with many functions to manipulate multidimensional arrays. R also has multidimensional arrays, but not as many functions to work with them. In R there are no scalars, there are vectors of length one. For a vector x in R we have dim(x) equal to NULL and length(x) > 0. For an array, including a matrix, we have … Read more →

# Dialogues Concerning Natural Religion

## by David Hume

David Hume’s Dialogues Concerning Natural Religion […] Original text from Project Gutenberg. This eBook is for the use of anyone anywhere at no cost and with almost no restrictions whatsoever. You may copy it, give it away or re-use it under the terms of the Project Gutenberg License included with this eBook or online at http://www.gutenberg.net. This page is created using the Tufte package within Bookdown. Both were written by primarily by Yihue Xie. Pamphilus to Hermippius It has been remarked, my HERMIPPUS, that though the ancient philosophers conveyed most of their instruction in the … Read more →

# 16S rRNA analysis

## by R.Lappan

Documentation describing my analyses of 16S rRNA sequencing data. […] My name is Rachael Lappan, and I am a PhD candidate at the University of Western Australia. The core of my PhD work is the Perth Otitis Media Microbiome (biOMe) study, where I work on the upper respiratory tract microbiome in children with recurrent acute otitis media (middle ear infections). The first stage of this research involved characterising the microbiome (by 16S rRNA gene sequencing) on samples from children with ear infections compared with samples from seemingly resistant healthy controls. The paper can be … Read more →

# A Practical Extension of Introductory Statistics in Psychology using R

## by Ekarin E. Pongpipat, Giuseppe G. Miranda, & Matthew J. Kmiecik

This book aims to provide a practical extension of introductory statistics typically taught in psychology into the general linear model (GLM) using R. […] Typically, introductory univariate statistics courses in psychology cover the following inferential analyses (plus or minus a few more analyses): These conventions may be useful for quickly talking about a particular statistical analysis with others; however, thinking of these analyses as derivatives (or special cases) of the GLM (i.e., ordinary least squares [OLS] regression) lends itself to understanding more advanced statistical … Read more →

# Advanced R

## by Hadley Wickham

This is the website for 2nd edition of “Advanced R”, a book in Chapman & Hall’s R Series. The book is designed primarily for R users who want to improve their programming skills and understanding of the language. It should also be useful for programmers coming to R from other languages, as help you to understand why R works the way it does. If you’re looking for the electronic version of the 1st edition, you can find it online at http://adv-r.had.co.nz/. This work, as a whole, is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The code … Read more →

# Advanced R Solutions

## by Malte Grosser, Henning Bumann & Hadley Wickham

Solutions to the Exercises from Hadley Wickham’s book ‘Advanced R’. […] This book offers solutions to the exercises from Hadley Wickham’s book Advanced R (Edition 2). It is work in progress and under active development. The 2nd edition of Advanced R is still being revised, but we hope to provide most of the answers in 2019. The solutions to the first edition of Advanced R can currently be found at https://advanced-r-solutions-ed1.netlify.com/. The code for this book can be found on GitHub. Your PRs and suggestions are very welcome. This work by Malte Grosser and Henning Bumann is licensed … Read more →

# Agile Data Science with R

## by Edwin Thoen

A workflow for doing data science in the R language, using Agile principles. […] When I was starting my career as a data scientist, I did not really have a workflow. Freshly out of statistics grad school I entered the arena of Dutch business, employed by a small consulting firm. Between the company, the potential clients and myself, no one knew what it meant to implement a statistical model or a machine learning method in the “real” world. But everybody was interested in this “Big Data” thing, so we quickly started to do consulting work without a clear idea what I was going to do. When we … Read more →

# Big Data and Social Science

## by Ian Foster, Rayid Ghani, Ron S. Jarmin, Frauke Kreuter and Julia Lane

Big Data and Social Science […] The class on which this book is based was created in response to a very real challenge: how to introduce new ideas and methodologies about economic and social measurement into a workplace focused on producing high-quality statistics. Since the first edition of this book came out we have been fortunate to train over 450 participants in the Applied Data Analytics classes, resulting in increased data analytics capacity, both in terms of human and technical resources. What we learned in delivering these classes greatly influenced the 2nd edition. We also added an … Read more →

# Climate Change Impact Assessment: A practical walk through

## by Conor I. Anderson and Karen L. Smith

A lab manual for students of Climate Change Impact Assessment […] This book is an open source document, hosted on the GitLab platform (project page), and published using GitLab Pages, where you are probably reading it now. The book is automatically updated and republished every time changes are committed to the project, using the GitLab multi runner CI engine, and a Docker image with a distribution of Miniconda, including Python 3 and R. The book is built using the bookdown package (Xie 2019) in R, and pandoc. Most of the code is executed in Python from within R using the reticulate package … Read more →

# ComplexHeatmap Complete Reference

## by Zuguang Gu

Complex heatmaps are efficient to visualize associations between different sources of data sets and reveal potential patterns. Here the ComplexHeatmap R package provides a highly flexible way to arrange multiple heatmaps and supports various annotation graphics. This book is the complete reference to ComplexHeatmap pacakge. […] This is the documentation of the ComplexHeatmap package. Examples in the book are generated under version 2.1.1. You can get a stable Bioconductor version from http://bioconductor.org/packages/release/bioc/html/ComplexHeatmap.html, but the most up-to-date version is … Read more →

# CookDown

## by Ellis Valentiner

A collection of recipes. […] This is a collection of recipes written in Bookdown. Feel free to … Read more →

# Data Science in Education Using R

## by Emily A. Bovee, Ryan A. Estrellado, Jesse Mostipak, Joshua M. Rosenberg, and Isabella C. Velásquez

Bookdown for ‘Data Science in Education Using R’ by Emily A. Bovee, Ryan A. Estrellado, Jesse Mostipak, Joshua M. Rosenberg, and Isabella C. Velásquez to be published by Routledge in 2020 […] Welcome to Data Science in Education Using R! Inspired by {bookdown}, this book is open source. Its contents are reproducible and publicly accessible for people worldwide. The online version of the book is hosted at datascienceineducation.com. There’s this story going around the internet about an eagle egg that hatches in a chicken farm. The eagle egg hatches near the chicken eggs. The local hens are … Read more →

# Data Science Live Book

## by Pablo Casas

An intuitive and practical approach to data analysis, data preparation and machine learning, suitable for all ages! […] This book is now available at Amazon. Check it out! 📗 🚀. Link to the black & white version, also available on full-color. It can be shipped to over 100 countries. 🌎 The book will facilitate the understanding of common issues when data analysis and machine learning are done. Building a predictive model is as difficult as one line of R code: That’s it. But, data has its dirtiness in practice. We need to sculp it, just like an artist does, to expose its information in order … Read more →

# Data Science Practice

## by Perry Stephenson

Course notes for 94692 Data Science Practice at the University of Technology, Sydney. […] This website forms the course notes for 94692 Data Science Practice which is an elective subject developed as part of the Master of Data Science and Innovation program at the University of Technology, Sydney. For more information about this subject see the Subject Information. For more information about the MDSI program see the MDSI Prospectus. Whilst these course materials have been produced specifically for MDSI students, they have been made available under a permissive license for the benefit of the … Read more →

# Data Visualization

## by Kieran Healy

A practical introduction. […] Published by Princeton University Press. Incomplete draft. This version: 2018-04-25. You should look at your data. Graphs and charts let you explore and learn about the structure of the information you collect. Good data visualizations also make it easier to communicate your ideas and findings to other people. Beyond that, producing effective plots from your own data is the best way to develop a good eye for reading and understanding graphs—good and bad—made by others, whether presented in research articles, business slide decks, public policy advocacy, or … Read more →

# Estadística Multivariada

## by María Teresa Ortiz, Felipe González

Curso de estadística multivariada, Maestría en Ciencia de Datos, ITAM 2015. […] Notas del curso Estadística Multivariada del programa de maestría en Ciencia de Datos del ITAM. Las notas fueron desarrolladas en 2014 por Teresa Ortiz y Felipe González y actualizadas en 2015, actualmente se trabaja en una segunda actualización. En caso de encontrar errores o tener sugerencias del material se agradece la propuesta de correcciones mediante pull requests. Notas: https://est-mult.netlify.com Correo: teresa.ortiz.mancera@gmail.com GitHub: https://github.com/tereom/est-multivariada Este trabajo está … Read more →

# Forecasting: Principles and Practice

## by Rob J Hyndman and George Athanasopoulos

Welcome to our online textbook on forecasting. This textbook is intended to provide a comprehensive introduction to forecasting methods and to present enough information about each method for readers to be able to use them sensibly. We don’t attempt to give a thorough discussion of the theoretical details behind each method, although the references at the end of each chapter will fill in many of those details. The book is written for three audiences: (1) people finding themselves doing forecasting in business when they may not have had any formal training in the area; (2) undergraduate … Read more →

# Fundamentals of Data Visualization

## by Claus O. Wilke

A guide to making visualizations that accurately reflect the data, tell a story, and look professional. […] This is the website for the book “Fundamentals of Data Visualization,” published by O’Reilly Media, Inc. The website contains the complete author manuscript before final copy-editing and other quality control. If you would like to order an official hardcopy or ebook, you can do so at various resellers, including Amazon, Barnes and Noble, Google Play, or Powells. The book is meant as a guide to making visualizations that accurately reflect the data, tell a story, and look professional. … Read more →

# Hands-On Programming with R

## by Garrett Grolemund

This book will teach you how to program in R, with hands-on examples. I wrote it for non-programmers to provide a friendly introduction to the R language. You’ll learn how to load data, assemble and disassemble data objects, navigate R’s environment system, write your own functions, and use all of R’s programming tools. Throughout the book, you’ll use your newfound skills to solve practical data science problems. Read more →

# How I Use R

## by David Keyes // R for the Rest of Us

How I Use R […] Since 2018, I’ve been teaching people to use R through my company, R for the Rest of Us. It’s an incredibly rewarding experience to see people learn to use this powerful piece of software, but it can also be frustrating. One of the hardest parts of learning R (or any language) is taking knowledge from exercises and applying it to an actual project you’re working on. Concepts that make sense in the classroom suddenly become muddled when you’re back at your desk trying to use R to write a report. One of the biggest challenges I’ve had as a teacher is helping people in this … Read more →

# Lightweight Machine Learning Classics with R

## by Marek Gagolewski

Explore some of the most fundamental algorithms which have stood the test of time and provide the basis for innovative solutions in data-driven AI. Learn how to use the R language for implementing various stages of data processing and modelling activities. Appreciate mathematics as the universal language for formalising data-intense problems and communicating their solutions. The book is for you if you’re yet to be fluent with university-level linear algebra, calculus and probability theory or you’ve forgotten all the maths you’ve ever learned, and are seeking a gentle, yet thorough, introduction to the topic. Read more →

# Mastering Spark with R

## by Javier Luraschi, Kevin Kuo, Edgar Ruiz

The Complete Guide to Large-Scale Analysis and Modeling. […] In this book you will learn how to use Apache Spark with R. The book intends to take someone unfamiliar with Spark or R and help you become proficient by teaching you a set of tools, skills and practices applicable to large-scale data science. You can purchase this book from Amazon, O’Reilly Media, your local bookstore, or use it online from this free to use website. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 United States … Read more →

# Math Prefresher for Political Scientists

## by iqss.github.io

Text for Harvard Department of Government Math Prefresher […] The Harvard Gov Prefresher is held each year in August. All relevant information is on our website, including the day-to-day schedule. The 2019 Prefresher instructors are Shannon Parker and Meg Schwenzfeier, and the faculty sponsor is Gary King. This booklet serves as the text for the Prefresher, available as a webpage and as a printable PDF. It is the product of generations of Prefresher instructors. See below for a full list of instructors and contributors. For information about the role of the prefresher (or “math camp”) as a … Read more →

# mixOmics vignette

## by Kim-Anh Le Cao1, Sebastien Dejean2, Al J Abadi3

Vignette for the R package mixOmics […] This document outlines the use of our key functions in our mixOmics package. If you run into any issues reproducing these results, please let us know by creating an issue here. We welcome transparent discussions and suggestions, feel free to on our new mixOmics Discourse forum! This document outlines the use of our key functions in our mixOmics package. If you run into any issues reproducing these results, please let us know by creating an issue here. We welcome transparent discussions and suggestions, feel free to share your own on our new mixOmics … Read more →

# Odds & Ends

## by Jonathan Weisberg

An open access textbook for introductory philosophy courses on probability and inductive logic. […] This textbook is for introductory philosophy courses on probability and inductive logic. It is based on a typical such course I teach at the University of Toronto, where we offer “Probability & Inductive Logic” in the second year, alongside the usual deductive logic intro.(,) The book assumes no deductive logic. The early chapters introduce the little that’s used. In fact almost no formal background is presumed, only very simple high school algebra. Several well known predecessors inspired … Read more →

# Physik Libre

## by Michael A. Rundel, et al.

Freies Physikbuch für die Sekundarstufe II … Read more →

# R for Data Science

## by Garrett Grolemund, Hadley Wickham

This book will teach you how to do data science with R: You’ll learn how to get your data into R, get it into the most useful structure, transform it, visualise it and model it. In this book, you will find a practicum of skills for data science. Just as a chemist learns how to clean test tubes and stock a lab, you’ll learn how to clean data and draw plots—and many other things besides. These are the skills that allow data science to happen, and here you will find the best practices for doing each of these things with R. You’ll learn how to use the grammar of graphics, literate programming, and reproducible research to save time. You’ll also learn how to manage cognitive resources to facilitate discoveries when wrangling, visualising, and exploring data. Read more →

# R Packages

## by Hadley Wickham, Jennifer Bryan

This book will teach you how to create a package, the fundamental unit of shareable, reusable, and reproducible R code. […] Packages are the fundamental units of reproducible R code. They include reusable R functions, the documentation that describes how to use them, and sample data. In this book you’ll learn how to turn your code into packages that others can easily download and use. Writing a package can seem overwhelming at first. So start with the basics and improve it over time. It doesn’t matter if your first version isn’t perfect as long as the next version is better. This is where … Read more →

# Self-Control in Cyberspace: Applying Dual Systems Theory to a Review of Digital Self-Control Tools

## by Ulrik Lyngs, Kai Lukoff, Petr Slovak, Reuben Binns, Adam Slack, Michael Inzlicht, Max Van Kleek, Nigel Shadbolt

Self-Control in Cyberspace: Applying Dual Systems Theory to a Review of Digital Self-Control Tools […] Note: This is the author’s version of the work. The definitive Version of Record was published in CHI Conference on Human Factors in Computing Systems Proceedings (CHI 2019), May 4–9, 2019, Glasgow, Scotland UK, doi.org/10.1145/3290605.3300361. Smartphones and laptops give their users access to an astonishing range of tasks anywhere, anytime. While this provides innumerable benefits, a growing amount of public discussion and research attention focuses on a perhaps unexpected … Read more →

# Spreadsheet Munging Strategies

## by Duncan Garmonsway

Spreadsheet Munging Strategies […] This is a work-in-progress book about getting data out of spreadsheets, no matter how peculiar. The book is designed primarily for R users who have to extract data from spreadsheets and who are already familiar with the tidyverse. It has a cookbook structure, and can be used as a reference, but readers who begin in the middle might have to work backwards from time to time. R packages that feature heavily are Tidyxl and unpivotr are much more complicated than readxl, and that’s the point. Tidyxl and unpivotr give you more power and complexity when you need … Read more →

# Statistical Analysis of Agricultural Experiments using R

## by Andrew Kniss & Jens Streibig

Using the R language to analyze agricultural experiments. […] Kniss AR, Streibig JC (2018) Statistical Analysis of Agricultural Experiments using R. https://Rstats4ag.org. Accessed … Read more →

# Statistical Thinking for the 21st Century | statsthinking21

Main web site for Statistical Thinking for the 21st Century Core statistical text R companion Python companion (coming soon!) This project is maintained by statsthinking21 Hosted on GitHub Pages — Theme by … Read more →

# The Tidynomicon

## by Dhavide Aruliah and Greg Wilson

The Tidynomicon […] Years ago, Patrick Burns wrote The R Inferno, a guide to R for those who think they are in hell. Upon first encountering the language after two decades of using Python, I thought Burns was an optimist—after all, hell has rules. I have since realized that R does too, and that they are no more confusing or contradictory than those of other programming languages. They only appear so because R draws on a tradition unfamiliar to those of us raised with derivatives of C. Counting from one, copying data rather than modifying it, lazy evaluation: to quote the other bard, these … Read more →

# Tidy evaluation

## by Lionel Henry, Hadley Wickham

The primary goal of this book is to get you up to speed with tidy evaluation and how to write functions around tidyverse pipelines and grammars. […] The primary goal of this book is to get you up to speed with tidy evaluation by showing you how to write functions using tidyverse pipelines and grammars. The book is written and organised so that you can quickly find the information you need to solve real world problems without having to “get” tidy eval first: The first chapter Getting up to speed is a quick introduction to the main pattern used in all tidy eval functions: quote and unquote. … Read more →

# Tidy tools for supporting fluent workflow in temporal data analysis

## by Earo Wang

This is the website for my PhD thesis at Monash University (Australia), titled “Tidy tools for supporting fluent workflow in temporal data analysis”. … Read more →

# Twitter for R programmers

## by Oscar Baruffa, Veerle van Son

Twitter for R programmers […] Artwork by @allison_horst The R community is very active on Twitter. You can learn a lot about the language, about new approaches to problems, make friends and even land a job or next contract. It’s a real-time pulse of the R community. This website is free to use, and is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License. So, you’re an R-programmer. What can you gain from becoming active on Twitter? This book will talk about the benefits and it will show you how to use Twitter. First of all, what is Twitter exactly? It’s a … Read more →

# Variability and Consistency in Early Language Learning

## by Michael C. Frank, Mika Braginsky, Daniel Yurovsky, and Virginia M. Marchman

This website is the free online version of a book whose current citation is: Frank, M. C., Braginsky, M., Marchman, V. A., and Yurovsky, D. (in prep). Variability and Consistency in Early Language Learning: The Wordbank Project. Cambridge, MA: MIT Press. The emergence of children’s early language is one of the most miraculous parts of human development. The ability to communicate using language arrives with incredible rapidity – most parents judge that their child is producing words with the intent to communicate before his or her first birthday (Schneider, Yurovsky, and Frank 2015) and the … Read more →

# VCRIS User Guide

## by VCRIS logo Virginia Department of Historic Resources

This is documentation for the Virginia Department of Historic Resources’ Virginia Cultural Resources Information Sytesm (VCRIS) application. […] VCRIS (Virginia Cultural Resource Information System) provides access to electronic records for historic properties in DHR’s Archives, as well as an online submission system for recording new buildings, structures, landscapes, and archaeological sites. VCRIS includes an interactive web map and detailed information about each site, along with evaluative information about the historic significance of resources. DHR launched VCRIS in 2013 and … Read more →