Below is a list of books written with bookdown, including those published to bookdown.org (books without substantial content are excluded) and a few hosted on external servers. The books are ordered roughly by date. An asterisk * after a date indicates the date is unknown, which often means a date field is missing in the YAML metadata of the source document index.Rmd. The list of books is automatically generated. For more information (including how to add or remove your books on this page), please see the About page.

# R Markdown: The Definitive Guide

2018-08-14

The first official book authored by the core R Markdown developers that provides a comprehensive and accurate reference to the R Markdown ecosystem. With R Markdown, you can easily create reproducible data analysis reports, presentations, dashboards, interactive applications, books, dissertations, websites, and journal articles, while enjoying the simplicity of Markdown and the great power of R and other languages. Read more →

1

# An Introduction to R and LaTeX

2018-08-14

An introduction to R for political scientists. […] This is an introduction to R and Latex. In compiling this documents, several sources have been consulted, including Tim Peterson’s website, Havard’s Math Prefresher, and the course offered by DataCamp. Make sure that you have a laptop throughout this introduction. Install the following applications, if you haven’t done so. Finally, this document is to be used in-class only. As I (will) mention several times, it borrows and merges a lot of resources online. Also, if you see any mistakes or have suggestions, please do shoot me an … Read more →

2

# Math Prefresher for Political Scientists

2018-08-14

Text for Harvard Department of Government Math Prefresher […] The Harvard Gov Prefresher is held each year in August. All relevant information is on our website: https://projects.iq.harvard.edu/prefresher. The 2018 Prefresher instructors are Shiro Kuriwaki and Yon Soo Park, and the faculty sponsor is Gary King. This booklet is the “text” for the Prefresher, and it is the product of generations of Prefresher Instructors: Curt Signorino 1996-1997; Ken Scheve 1997-1998; Eric Dickson 1998-2000; Orit Kedar 1999; James Fowler 2000-2001; Kosuke Imai 2001-2002; Jacob Kline 2002; Dan Epstein 2003; … Read more →

3

# Geocomputation with R

2018-08-14

Forthcoming book on geographic data with R. […] Welcome to the online home of Geocomputation with R, a forthcoming book with CRC Press. Inspired by bookdown and other open source projects we are developing this book in the open. This approach encourages contributions, ensures reproducibility and provides access to the material as it evolves. The book development can be divided into three main phases: New chapters will be added to this website as the project progresses, hosted at geocompr.robinlovelace.net and kept up-to-date thanks to Travis which ensures the reproducibility: The version of … Read more →

4

# Mixed Models in R

2018-08-14

This is an introduction to mixed models in R. It covers a many of the most common techniques employed in such models, and relies heavily on the lme4 package. The basics of random intercepts and slopes models, crossed vs. nested models, etc. are covered. Discussion includes extensions into generalized mixed models and realms beyond. […] … Read more →

5

# Text Mining with R

2018-08-14

A guide to text analysis within the tidy data framework, using the tidytext package and other tidy tools […] This is the website for Text Mining with R! Visit the GitHub repository for this site, find the book at O’Reilly, or buy it on Amazon. This work by Julia Silge and David Robinson is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States … Read more →

6

# Advanced R Solutions

2018-08-13

Solutions to the exercises in Hadley Wickham’s book Advanced R. […] This book aims to contribute solutions to Hadley Wickham’s book Advanced R. We hope to finish the answers to all the exercises till the end of 2018. The code of the book can be found on github. The date of the exercise versions in Hadley’s book (2nd edition) is June 29th 2018. The date of the exercise versions in Hadley’s book (1st edition) is January 25th 2017. This work by Malte Grosser, Henning Bumann, Peter Hurford & Robert Krzyzanowski is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 … Read more →

7

# Population Health Data Science with R

2018-08-13

Population health data science (PHDS) is the art and science of transforming data into actionable knowledge to improve health. R is an open source programming environment for statistical computing and graphics. PHDS is captured by four words: describe, predict, discover, and advise. […] We are writing this book to introduce R—a programming language and environment for statistical computing and graphics—to public health epidemiologists and health care analysts conducting population health analyses. Recent graduates come prepared with a solid foundation in epidemiological and statistical … Read more →

8

2018-08-10

9

# bookdown: Authoring Books and Technical Documents with R Markdown

2018-08-10

A guide to authoring books with R Markdown, including how to generate figures and tables, and insert cross-references, citations, HTML widgets, and Shiny apps in R Markdown. The book can be exported to HTML, PDF, and e-books (e.g. EPUB). The book style is customizable. You can easily write and preview the book in RStudio IDE or other editors, and host the book wherever you want (e.g. bookdown.org). Read more →

10

# Data Processing & Visualization

2018-08-10

The focus of this document is on common data processing and exploration techniques in R, especially as a prelude to visualization. The first part of the document will cover data structures, the dplyr and tidyverse packages, which enhance and facilitate the sorts of operations that typically arise when dealing with data, including faster I/O and grouped operations. For visualization, the focus will be on using ggplot2 and other packages that allow for interactivity. In addition, basic programming concepts and techniques are introduced. Exercises may be found in the document as well. In addition, the demonstrations of the data processing section are available in Python via Jupyter notebooks. Read more →

11

2018-08-09

12

# Introducción a estadística con R

2018-08-09

Este libro introduce conceptos de estadística utilizando R. Está principalmente orientado a estudiantes que deseen aplicar e incrementar sus conocimientos estadísticos usando un lenguaje de programación. Sin embargo, aquellos usuarios que tengan algo de experiencia con R y quieran aventurarse a aumentar sus conocimientos estadísticos pueden encontrar utilidad en los capítulos más avanzados. […] R es quizás el lenguaje más desarrollado para realizar análisis exploratorios de datos y estadística. Debido a que posee una naturaleza dinámica, gratuita, open-source, y una comunidad que trabaja … Read more →

13

# TensorFlow 学习笔记

2018-08-08

TensorFlow 学习笔记 […] 本作品是针对 Tensorflow 深度学习框架的学习笔记，参考的相关资料包括： 本作品使用 R 语言的 Bookdown 扩展包构建，在线版本托管在 https://bookdown.org/leovan/TensorFlow-Learning-Notes ，离线版本请访问托管网站下载。 本作品中使用的部分图标来自 Papirus 图标集。 本作品编译的 PDF 采用 Chapman & Hall 出版社提供的 LaTeX 模板 krantz.cls，英文衬线字体采用 Alegreya，英文无衬线字体采用 Helvetica，中文衬线字体采用 Source Han Serif SC，中文无衬线字体采用 Source Han Sans SC，中文斜体字体采用 Kaiti SC，中英文等宽字体采用 Sarasa Mono SC，数学公式字体采用 Latin Modern Math。 本作品采用 … Read more →

14

# Leading Population Health

2018-08-07

A book example for a Chapman & Hall book. […] In December, 2010, I was appointed the health officer of the City and County of San Francisco, and the director of the Population Health Division1 at the San Francisco Department of Public Health (SFDPH). As health officer I exercised leadership and legal authority to protect and promote health and health equity. For the Population Health Division I directed public health services. Prior to SFDPH I trained in primary care internal medicine, clinical infectious diseases, and epidemiology. Since 1996, I worked as a deputy health officer in various … Read more →

15

# Data Science at the Command Line

2018-08-03

This is the website for Data Science at the Command Line, published by O’Reilly October 2014 First Edition. This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. To get you started—whether you’re on Windows, macOS, or Linux—author Jeroen Janssens has developed a Docker image packed with over 80 command-line tools. Discover why the command line is an agile, scalable, and extensible … Read more →

16

# Manual de R

2018-07-31

En este libro muestra la forma de usar R para realizar analisis estadistico. […] … Read more →

17

# recoding Introduction to Mediation, Moderation, and Conditional Process Analysis

2018-07-30

This project is an effort to connect his Hayes’s conditional process analysis work with the Bayesian paradigm. Herein I refit his models with my favorite R package for Bayesian regression, Bürkner’s brms. I use syntax based on sensibilities from the tidyverse and plot with Wickham’s ggplot2. […] Andrew Hayes’s Introduction to Mediation, Moderation, and Conditional Process Analysis text, the second edition of which just came out, has become a staple in social science graduate education. Both editions of his text have been from a frequentist OLS perspective. This project is an effort to … Read more →

18

# R语言笔记

2018-07-30

Some notes about R and other open souce softwares, such as Pandoc, LaTeX, Inkscape, Ghostscript, Git, Stan, Octave and Python. […] 荃者所以在鱼，得鱼而忘荃；蹄者所以在兔；得兔而忘蹄；言者所以在意，得意而忘言。吾安得夫忘言之人而与之言哉！ The fish trap exists because of the fish; once you’ve gotten the fish, you can forget the trap. The rabbit snare exists because of the rabbit; once you’ve gotten the rabbit, you can forget the snare. Words exist because of meaning; once you’ve gotten the meaning, you can forget the words. Where … Read more →

19

# Bayesian Basics

2018-07-30

This document provides an introduction to Bayesian data analysis. It is conceptual in nature, but uses the probabilistic programming language Stan for demonstration (and its implementation in R via rstan). From elementary examples, guidance is provided for data preparation, efficient modeling, diagnostics, and more. […] … Read more →

20

# Graphical & Latent Variable Modeling

2018-07-28

This document focuses on structural equation modeling. It is conceptually based, and tries to generalize beyond the standard SEM treatment. It includes special emphasis on the lavaan package. Topics include: graphical models, including path analysis, bayesian networks, and network analysis, mediation, moderation, latent variable models, including principal components analysis and ‘factor analysis’, measurement models, structural equation models, mixture models, growth curves, item response theory, Bayesian nonparametric techniques, latent dirichlet allocation, and more. Read more →

21

# An Incomplete Solutions Guide to the NIST/SEMATECH e-Handbook of Statistical Methods

2018-07-22

Analysis of case studies and exercies with a focus on using the tidyverse and ggplot2. This handbook was created using the bookdown package in RStudio. The output format for this example is bookdown::gitbook. […] Exploratory Data Analysis (EDA) is a philosophy on how to work with data, and for many applications, the workflow is better suited for most working scientist and engineers. As a scientist, we are trained to formulate a hypothesis and design a series of experiments that will allow us to test the hypothesis effectively. Unfortunately, most data doesn’t from carefully controlled … Read more →

22

# An Introduction to Text Processing and Analysis with R

2018-07-22

This document covers a wide range of topics, including how to process text generally, and demonstrations of sentiment analysis, parts-of-speech tagging, word embeddings, and topic modeling. Exercises are provided for some topics. … Read more →

23

# An Introduction to Statistical and Data Sciences via R

2018-07-21

An open-source and fully-reproducible electronic textbook bridging the gap between traditional introductory statistics and data science courses. […] Help! I’m new to R and RStudio and I need to learn about them! However, I’m completely new to coding! What do I do? If you’re asking yourself this question, then you’ve come to the right place! Start with our Introduction for Students. This is version 0.4.0 of ModernDive published on July 21, 2018. For previous versions of ModernDive, see Section 1.5. This book assumes no prerequisites: no algebra, no calculus, and no prior programming/coding … Read more →

24

# Generalized Additive Models

2018-07-21

An introduction to generalized additive models (GAMs) is provided, with an emphasis on generalization from familiar linear models. It makes extensive use of the mgcv package in R. Discussion includes common approaches, standard extensions, and relations to other techniques. More technical modeling details are described and demonstrated as well. […] … Read more →

25

# Sosyal Bilimler Araştırmaları İçin R

2018-07-20

Sosyal Bilimler Araştırmaları İçin R […] Bu döküman sosyal bilimler araştırmalarında R programlama dilinin kullanımını anlatmaktadır. İçeriği ile ilgili geri bildirimler için emretoros@gmail.com adresine email atabilirsiniz. Dökümanı kullanmaya başlamadan önce aşağıdaki linkte bulunan 3 videoyu seyretmenizi öneririm. Giriş … Read more →

26

# Field Epidemiology with R

2018-07-18

A book example for a Chapman & Hall book. […] The document format “R Markdown” was first introduced in the knitr package (Xie, 2015, 2018) in early 2012. The idea was to embed code chunks (of R or other languages) in Markdown documents. In fact, knitr supported several authoring languages from the beginning in addition to Markdown, including LaTeX, HTML, AsciiDoc, reStructuredText, and Textile. Looking back over the five years, it seems to be fair to say that Markdown has become the most popular document format, which is what we expected. The simplicity of Markdown clearly stands out among … Read more →

27

# Clustered Data

2018-07-18

This document provides a brief comparison of various approaches to dealing with clustered data situations. … Read more →

28

# A short course on Survival Analysis applied to the Financial Industry

2018-07-17

This is a short course on survival analysis applied to the financial field. […] This book is designed to provide a guide for a short course on survival analysis. It is mainly focussed on applying the stastical tecnquines developed in the survival field to the financial industry. The emphasis is placed in understanding the methods, building intuition about when aplying each of them and showing their application through the use of statistical … Read more →

29

# Advanced Statistical Computing

2018-07-17

The book covers material taught in the Johns Hopkins Biostatistics Advanced Statistical Computing course. I taught this course off and on from 2003–2016 to upper level PhD students in Biostatistics. The course ran for 8 weeks each year, which is a fairly compressed schedule for material of this nature. Because of the short time frame, I felt the need to present material in a manner that assumed that students would often be using others’ software to implement these algorithms but that they would need to know what was going on underneath. In particular, should something go wrong with one of … Read more →

30

# Understanding Work With Data in Summer STEM Programs Through An Experience Sampling Method Approach

2018-07-16

This is Joshua Rosenberg’s dissertation […] Data-rich activities provide an opportunity to develop core competencies in both science and mathematics identified in curricular standards. Perhaps even more importantly work with data puts learners in the position to use data to ask and answer questions, a potentially empowering capability. Research on work with data has focused on cognitive outcomes and the development of specific practices at the student and classroom levels, and yet, little research has considered learners’ engagement. The present study explores learners engagement in work … Read more →

31

2018-07-16

32

# Learning statistics with R: A tutorial for psychology students and other beginners. (Version 0.6.1)

2018-07-15

Learning Statistics with R covers the contents of an introductory statistics class, as typically taught to undergraduate psychology students, focusing on the use of the R statistical software. […] Learning Statistics with R covers the contents of an introductory statistics class, as typically taught to undergraduate psychology students, focusing on the use of the R statistical software. The book discusses how to get started in R as well as giving an introduction to data manipulation and writing scripts. From a statistical perspective, the book discusses descriptive statistics and … Read more →

33

# 基于R语言的科研信息分析与服务

2018-07-15

Scientific Research information service using R […] 在图书馆开设R语言系列讲座也有一年半载了，在此过程中我萌生了用R语言写一本书的想法，一方面是想为学生提供R语言学习范例，另一方面也借此为我校科研人员提供一些科研信息服务。如果此举能做到教学相长，更好地实践和应用数据科学，也算是一次很有意义的尝试，无奈自己时间精力有限，写书进展缓慢。 这本书是这样的， 第 1 章简单介绍数据科学与R语言， 第 2 章引入科研信息数据集，并利用tidyverse宏包进行数理统计和数据可视化， 第 3 章统计科研论文中通讯地址使用情况，并给出写作的规范建议， 第 4 章介绍了各学院对ESI学科的贡献，以及期刊对引文的贡献， 第 5 章基于中科院JCR期刊分区分析我校科研人员的选刊倾向， 第 6 … Read more →

34

# Introducción a la Computación con GPUs usando R

2018-07-14

Revisión de conceptos clave sobre la computación GPGPU, y algunos ejemplos simples de uso de librerías aceleradas por GPU […] Las GPU (Graphics Processing Units; Unidades de Procesamiento de Gráficos) son unidades de procesamiento diseñadas originalmente para procesar gráficos en una computadora rápidamente. Esto se hace teniendo una gran cantidad de unidades de procesamiento simples para cálculos masivamente paralelos. La idea de la computación de propósito general en GPU (GPGPU: general purpose GPU computing) es explotar esta capacidad para el cálculo general. En este tutorial se revisará … Read more →

35

# HPC con R para Investigadores

2018-07-13

HPC con R para Investigadores […] “Programmers waste enormous amounts of time thinking about, or worrying about, the speed of noncritical parts of their programs, and these attempts at efficiency actually have a strong negative impact when debugging and maintenance are considered.” — Donald Knuth. Optimizar código para hacerlo más rápido es un proceso … Read more →

36

2018-07-08

37

2018-07-07

38

# Meta-Workflow

2018-07-04

This is a workflow for metabolomics studies. […] This is an online handout for data analysis in mass spectrometry based metabolomics. It would cover a full reproducible metabolomics workflow for data analysis and important topics related to metabolomics. Here is a list: This is a book written in Bookdown. You could contribute it by a pull request in Github. R and Rstudio are the softwares needed in this … Read more →

39

2018-06-21

40

# Macroeconomics

2018-06-19

This is a collection of the discussion lists from Macroeconomics. […] The theory contents will follow 1 closely. Item 2 is for data visualization. And item 3 is for general discussion regarding world news. https://goo.gl/kbQwP5 Class participation and quizzes: 10% Midterm Exam: 30% Final Exam: 30% Others Rhttp://www.r-project.org/ RStudiohttp://rstudio.org/ Github desktophttps://desktop.github.com/ … Read more →

41

# Data Visualization Project

2018-06-17

Data Visualization Project […] This study aims at investigating how the change of information dissemination process would affect the window-dressing behaviors of mutual fund managers. By convention, window-dressing is defined as the portfolio manipulations right before the quarter-end date, when all the fund managers are required to disclosure their holding firms of that date. Over the past decades, technological progresses largely change the way how information disseminates, and these further influence the information flow of capital markets. For example, the implementation of “Electronic … Read more →

42

# «Two Lives» by Concordia Antarova

2018-06-17

«Two Lives» by Concordia Antarova: text translation and analysis […] This work presents the working draft of the English translation of the “The Lives” book by Concordia Antarova. Widely known in Russian speaking spheres, and translated into French, this book remains to be largely unknown to English speaking population, despite its significant spiritual importance, comparable to that of “Book of Joy”. While the efforts on translating this book into English continue, here the draft of it is used for Artificial Intelligence (AI) projects, aiming at building the systems for automated analysis … Read more →

43

# «Кубатура Шара»

2018-06-17

Poetry of Andrey Gorodnichy […] Версия для печати: PDF, EPUB. Online: https://bookdown.org/gorodnichy/andre. … Read more →

44

# Marketing department KULeuven: R tutorial

2018-06-15

KULeuven R tutorial for marketing students […] In this tutorial, we will explore R as a tool to analyse and visualise data. R is a statistical programming language that has rapidly gained popularity in many scientific fields. The main difference between R and other statistical software like SPSS is that R has no graphical user interface. There are no buttons to click. R is run entirely by typing commands into a text interface. This may seem daunting, but hopefully by the end of this tutorial you will see how R can help you to do better statistical analysis. So why are we using R and not one … Read more →

45

# Foundations of Statistics with R

2018-06-10

This book is written for the purposes of teaching STAT 3850 at Saint Louis University. […] This is a book on probability and statistics suitable for the sophomore or junior level at university. We assume knowledge of calculus at the level of Calculus II. We do not assume prior experience with statistics or programming, though students who have no experience with either statistics or programming before starting this class should expect to have to work hard. We will be using R as an integral part of the exposition — you should not read this book without first getting R Studio installed. We … Read more →

46

# Introduction to R Markdown

2018-06-09

This document will introduce participants to the basics of R Markdown. After an introduction to concepts related to reproducible programming and research, demonstrations of standard markdown, as well as overviews of different formats, will be provided, including exercises. […] … Read more →

47

# blogdown: Creating Websites with R Markdown

2018-06-05

A guide to creating websites with R Markdown and the R package blogdown. […] In the summer of 2012, I did my internship at AT&T Labs Research,1 where I attended a talk given by Carlos Scheidegger (https://cscheid.net), and Carlos said something along the lines of “if you don’t have a website nowadays, you don’t exist.” Later I paraphrased it as: “I web, therefore I am a spiderman.” Carlos’s words resonated very well with me, although they were a little exaggerated. A well-designed and maintained website can be extremely helpful for other people to know you, and you do not need to wait for … Read more →

48

# Thucydides the Neorealist?

2018-06-02

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] Thucydides has long been viewed as an early exemplar of realist thinking in International Relations Theory. More recently, neorealist authors have claimed that Thucydides’ History offers insights into the importance of the anarchy in shaping interstate relations, and should be recognised as a neorealist. This neorealist appropriation has met substantial criticism and many revisionist scholars have urged a re-examining of Thucydides. This dissertation serves … Read more →

49

2018-06-01

50

2018-06-01

51

# (Applied) Causal Analysis

2018-05-30

Script for the seminar Causal Analysis at the University of Mannheim. […] The present document serves both as slides and script for the workshop/seminar Causal Analysis. This seminar is taught by Paul C. Bauer (right now - Spring Semester 2018 - at the University of Mannheim). The material was developed by Paul C. Bauer and is based on earlier material developed for workshops at the European University Institute and at the Programme doctoral en science politique (PDSPO), Switzerland. It is licensed under a Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0). Conditions for … Read more →

52

# Feature Engineering and Selection: A Practical Approach for Predictive Models

2018-05-24

Feature Engineering and Selection: A Practical Approach for Predictive Models […] A note to readers: this text is a work in progress. It will eventually be published in this format as well as a more traditional physical medium by Chapman & Hall/CRC. Chapters 1 through 6 have been completed (pending more comments) and the rest are being constructed as you read this. We’ve released this initial version to get more feedback beyond what our excellent reviewers and editor have already provided. Feedback can be given at the GitHub repo https://github.com/topepo/FES/issues. Copyediting has not … Read more →

53

2018-05-17

54

# «Shri Jobim»

2018-05-16

Antonio Carlos Jobim songs re-interpreted by Dmitry Gorodnichy […] Альбом состоит из тематически разделенных двух частей. Первая часть («Grand amor») состоит из песен о любви, вторая («Ocean») из песен о смысле и красоте жизни. Помимо слов и аккордов, прилагаются записи песен: оригиналы из YouTube и их новые интерпретации в формате mp3. Записи, помеченные “+”, содержат дорожку, наложенную на правый канал первоначальной записи. Регулируя баланс, можно достичь желаемой громкости добавленной дорожки. Переход от песни к песни можно осуществить либо через открывающееся слева меню, либо … Read more →

55

# Hello Py: Python 程式設計

2018-05-14

Pyradise 是專注於 Python 教學的團隊，致力於分享學習經驗，推廣資料科學，人工智慧，讓更多人能參與到這波資訊與人工智慧的學習浪潮。 專注於技術，熱衷於教學的開發者，希望透過教學，傳遞出更多想法的帽子哥。 資料科學與推廣教育的愛好者，閒暇時喜歡長跑與乒乓球；是 2017 iT邦幫忙鐵人賽 Big Data 組冠軍。 前端工程師與設計師。 … Read more →

56

# Introduction to Digital Currency

2018-05-12

A summary of research conducted hitherto. […] This is research I have conducted for personal use. Using the bookdown package has enabled me to piece together my research in a quick and neat manner. I have tried to convey complex terms as simply as possible utilizing visual examples where I can. Constructive criticism is welcomed - I will regularly be updating this … Read more →

57

2018-05-12

58

# R for Social Scientists

2018-05-10

Script for a an R course at the European University Institute. … Read more →

59

# Lecture Notes voor Beleidsinformatica

2018-05-07

Dit zijn de lecture notes van het opleidingsonderdeel Beleidsinformatica […] Dit document bevatten de lecture notes voor het opleidingsonderdeel Beleidsinformatica (3512), gedoceerd aan de Universiteit Hasselt. Ieder hoofdstuk dient ter ondersteuning van een van de hoorcolleges en bevat zowel een samenvatting in “bullet-point” stijl alsook een verzameling bronnen op basis waarvan het hoorcollege is opgebouwd. We raden aan om deze lecture notes steeds kort na het hoorcollege door te nemen en aan te vullen met je eigen notities uit het college. Ook raden we aan de bronnen te raadplegen voor … Read more →

60

# Machine Learning

2018-05-06

This document provides an introduction to machine learning for applied researchers. While conceptual in nature, demonstrations are provided for several common machine learning approaches of a supervised nature. In addition, all the R examples, which utilize the caret package, are also provided in Python via scikit-learn. […] Michael Clark https://m-clark.github.io … Read more →

61

# Steem Handbook

2018-05-03

Steem Handbook […] 本书的编写和维护需要长久的贡献，大门永远敞开，欢迎加入我们。你可以： 贡献本书缺失的内容； 修改已有内容； 修改错别字； 其他任何跟书稿编写有关的工作。 向本书项目投稿的方法见附录16。 主编：@dapeng 副主编： @maiyude 顾问（按字母顺序）：@deanliu @jademont @lemooljiang @oflyhigh @rivalhw @sweetsssj @tumutanzi 编剧： @maiyude 封面设计： @maiyude 本书的各章作者、编辑、校对见各章节脚注。待书稿完成后，名单将汇总在这里。 … Read more →

62

# R_Dplyr_minicourse

2018-05-03

E.Major R_Dplyr_minicourse […] Dplyr是R語言當中相當重要的資料處理套件，同時也是跨出探索式資料分析的第一步。 探索式資料分析是透過視覺化或敘述統計的方式，去觀察資料本身的特性或者變數與變數之間的關聯，以求對資料有更多的認識，看看是否有意外有趣的發現或者不符合常理的地方。當然也包含資料清理與建立必要變數的部分，必要時需要透過爬蟲或者引入第三方資料，才算完整。 資料處理做得好，整體的分析方向和後面的統計建模才會有意義且往對的道路前進，以避免不必要的時間、資源浪費。 本課程將逐步介紹Dplyr常用的分析資料函數，會搭配dplyr cheatsheet以及Help做講解，希望能夠在熟悉基本工具之後，未來甚至不需要這本電子書就能 … Read more →

63

# Meu log de leitura de R for Data Science

2018-05-01

Meu log de leitura de R for Data Science […] Se tem alguma pessoa que pode ser considerada um “pop star” do R, seria o Hadley Wickham: o cara é responsável pelo ggplot2 e pelo dplyr, que são alguns dos pacotes mais populares do R! Mas são justamente pacotes que eu quase não uso… :( Deixe eu explicar melhor. Eu sou usuário do R há muitos anos (fiz as contas de cabeça enquanto eu escrevo, e se não me enganei, agora em 2018 seriam uns 13 ou 14 anos!), então já tem um bocado de tempo que aprendi a como resolver (e ensinar) algumas coisas. Até aí tudo bem. Acontece que o Hadley trouxe uma … Read more →

64

# Technical Foundations of Informatics

2018-04-26

The course reader for INFO 201: Technical Foundations of Informatics. […] This book covers the foundation skills necessary to start writing computer programs to work with data using modern and reproducible techniques. It requires no technical background. These materials were developed for the INFO 201: Technical Foundations of Informatics course taught at the University of Washington Information School; however they have been structured to be an online resource for anyone hoping to learn to work with information using programmatic approaches. This book is licensed under a Creative Commons … Read more →

65

# Lecture Notes voor Business Process Management (3637)

2018-04-25

Dit zijn de lecture notes van het opleidingsonderdeel Business Process Management […] Dit document bevatten de lecture notes voor het opleidingsonderdeel Business Process Management (3637), gedoceerd aan de Universiteit Hasselt. Deze lecture notes dienen ter ondersteuning van de colleges en bevatten zowel een “bullet-point” samenvatting van de voornaamste topics alsook een verzameling van bronnen voor verdere verdieping in de … Read more →

66

# ggplot2 介紹

2018-04-21

ggplot2 介紹 […] hypothes.is: https://hypothes.is/groups/eBBqEGde/minicourse-ggplot2 要在hypothes.is貼上程式碼時，請依下例張貼： ggplot2 cheatsheet Computing for the Social Sciences, U.Chicago. ggplot2part of the … Read more →

67

# Brief introduction to Statistic

2018-04-20

Brief introduction to Statistic […] Many statistical quantities derived from data samples are found to follow the Chi-squared distribution. Hence we can use it to test whether a population fits a particular theoretical probability distribution. In this section, we consider a multinomial experiment with k outcomes that correspond to categories of a single qualitative variable. The results of such an experiment are summarized in a one-way table. The term one-way is used because only one variable is classified. Typically, we want to make inferences about the true proportions that occur in the … Read more →

68

# Lösningar i R till vissa uppgifter från övningskompendierna (samt lite annat kul)

2018-04-19

Lösningar för vissa uppgifter i kursen Statstik A4/A8 […] Detta dokument är till för dig som läser kursen Statistik A4/A8 och är nyfiken på R. Innehållet är tänkt att förena lite nytta (lösa uppgifter) med nöje (lära dig lite R). Det är inte meningen att detta dokument skall fungera som en heltäckande introduktion till programmeringsspråket R. Det finns mängder av väldigt välskrivna guider online som fokuserar mycket mer på hur språket är uppbygt. Lyckligtvis är R väldigt enkelt att komma igång med, och det krävs inte mycket förståelse för själva språket för att göra enkla beräkningar, … Read more →

69

# Handling Strings with R

2018-04-19

This book aims to provide a panoramic perspective of the wide array of string manipulations that you can perform with R. If you are new to R, or lack experience working with character data, this book will help you get started with the basics of handling strings. Likewise, if you are already familiar with R, you will find material that shows you how to do more advanced string and text processing operations. Read more →

70

# Lecture Notes voor Exploratieve en Descriptieve Data Analyse

2018-04-16

Dit zijn de lecture notes van het opleidingsonderdeel Exploratieve en Descriptieve Data Analyse […] Dit boek bevat de lecture notes voor de cursus “Exploratieve en Descriptieve Data Analyse” (1ste Ba Handelsingenieur/Handelsingenieur in de Beleidsinformatica) aan de Universiteit Hasselt. Het idee van dit document is een begeleidende tekst aan te reiken ter ondersteuning van de slide-decks die gebruikt worden tijdens de hoorcolleges. Deze tekst is “bullet-point” gewijs opgebouwd en helpt het verhaal dat tijdens het hoorcollege wordt verteld terug op te roepen. Daarnaast zal er per hoofdstuk … Read more →

71

# New statistics for the design researcher

2018-04-07

A statistics book for designers, human factors specialists, UX researchers, applied psychologists and everyone else who works hard to make this world a better place. […] This book makes the following assumptions: Chapter @ref(design_research) introduces a framework for quantitative design research. It carves out the basic elements of empirical design research, such as users, designs and performance and links them to typical research problems. Then the idea of design as decision making under uncertainty is developed at the example of two case studies. Chapter @ref(bayesian_statistics) … Read more →

72

2018-04-05

73

# Noções de Inferência no R

2018-03-28

Esta apostila é uma ferramenta de apoio às aulas teóricas de ME319-Noções de Inferência. […] O objetivo desta apostila é apresentar os conceitos de inferência ministrados em sala de aula na disciplina ME319 - Noções de Inferência de uma forma prática e intuitiva utilizando recursos computacionais como o software R. ME319 - Noções de Inferência - IMECC/UNICAMP Após fazer a instalação do R vamos instalar o RStudio. O RStudio é uma nova interface para o R com diversas propriedade que facilita o uso do … Read more →

74

# Novel methods for dose–response meta-analysis

2018-03-06

Novel methods for dose–response meta-analysis […] A single experiment can hardly provide a definitive answer to a scientific question. Science is oftentimes referred to as a cumulative process where results from many studies, aiming to address a common question of interest, contribute to create and update the scientific evidence. In the cumulative paradigm, meta-analysis is the statistical methodology to combine and compare the current evidence in the field. This process lies at the heart of the concept of evidence-based medicine and plays a major role in policy and decision making. … Read more →

75

# The Queens College Guide to Life

2018-03-02

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a collaboratively written guide for courses at Queens College that focus on the biology of tiny things, which … Read more →

76

# The Queens College Stats Guide

2018-03-01

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a guide to learning statistics at Queens College. Currently we support the following courses directly with … Read more →

77

# An R Platform for Social Scientists

2018-02-27

R book for social scientists […] The online version of this platform is licensed under the CC0 by Burak AYDIN. We aim to create a platform for the applied social scientists in which we can demonstrate basic statistical procedures using R (R Core Team 2016b) and real data. We prefer to name this material as a platform given that (a) it is open for contribution, (b) it will have dynamic content and © it can serve as a mainboard for Plug-ins and Add-ons . This R material is created with Bookdown (Xie 2016), an advanced system constructed on R Markdown (Allaire et al. 2016) and the R … Read more →

78

# Sosyal Bilimler R Platformu

2018-02-27

Sosyal Bilimler R Platformu […] Bu platformun hakları korunmuştur CC0 by Burak AYDIN. Bu materyal İngilizce olarak hazırlanıp Türkçeye çevirilmiştir. Bu platform sosyal bilimler alanında çalışan ve nicel veri analizlerinin teoriden ziyade uygulama aşamasına ilgi gösteren araştırmacılar için oluşturulmuştur. Bütün istatistiksel prosedürler R (R Core Team 2016b) ile yürütülmüş, gerçek veri kullanımına özen gösterilmiştir. Bu materyale platform denilmesinin üç sebebi vardır, (a) katkıya açıktır,(b) dinamik bir içeriğe sahiptir, © bilgisayar anakartı gibi kullanılabilir, R ile oluşturulmuş … Read more →

79

# The Queens College Guide to Ecosystems

2018-02-26

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a collaboratively written guide for courses at queens college that focus on the biology of big things, like communities and cats, which … Read more →

80

# The Queens College Alchemy Guide

2018-02-26

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a collaboratively written guide for chemistry courses at Queens College, … Read more →

81

# The Queens College Political Guide

2018-02-26

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a guide to learning political science courses at queens college, as well as a resource for learning about political science related fields like labor studies, community organizing, and urban studies. Currently the guide supports the following … Read more →

82

# Numerical Analysis: Notes

2018-02-23

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a collection of my notes and algorithms from a course on Numerical Analysis at the University of Iceland. The book used in the course was Numerical Analysis by Timothy … Read more →

83

# Notes for Predictive Modeling

2018-02-20

Notes for Predictive Modeling […] The reason is because they are hosted at https websites with auto-signed SSL … Read more →

84

# A short course on nonparametric curve estimation

2018-02-20

A short course on nonparametric curve estimation […] This course is intended to provide an introduction to nonparametric estimation of the density and regression functions from, mostly, the perspective of kernel smoothing. The emphasis is placed in building intuition behind the methods, gaining insights into their asymptotic properties, and showing their application through the use of statistical software. The reason is because they are hosted at https websites with auto-signed SSL … Read more →

85

# The Final War

2018-02-19

My book about adventures in a video game named Minecraft. […] This book is about my adventures in a video game named … Read more →

86

# Comparative Methods

2018-02-15

How to do comparative methods for evolution and ecology […] This book was created as part of my PhyloMeth class, which focuses on sensibly using and developing comparative methods. It will be actively developed over the course of Spring 2017, so if you don’t like this version (see date above), check back soon! The book is available here but you can fork it, add issues, and look at raw source code at https://github.com/bomeara/ComparativeMethodsInR. [Note I’ll be changing the name of the repo eventually; the course is largely in R (not entirely) but of course many key methods appear in other … Read more →

87

# APS 135: Introduction to Exploratory Data Analysis with R

2018-02-12

Course book for Introduction to Exploratory Data Analysis with R (APS 135) in the Department of Animal and Plant Sciences, University of Sheffield. […] This is the online course book for the Introduction to Exploratory Data Analysis with R component of (APS 135) module. You can view this book in any modern desktop browser, as well as on your phone or tablet device. The site is self-contained—it contains all the material you are expected to learn this year. Bethan Hindle is running the course this year. Please email her if you spot any problems. You will be introduced to the R ecosystem. R … Read more →

88

# Broadening Your Statistical Horizons

2018-02-10

Test. […] Broadening Your Statistical Horizons (BYSH): Generalized Linear Models and Multilevel Models is intended to be accessible to undergraduate students who have successfully completed a regression course through, for example, a textbook like Stat2. We started teaching this course at St. Olaf in 2003 so students would be able to deal with the non-normal, correlated world we live in. It has been offered at St. Olaf every year since; in fact, it is required for all statistics concentrators. Even though there is no mathematical prerequisite, we still introduce fairly sophisticated topics … Read more →

89

# Math 390.4: Data Science with R

2018-02-03

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a book published using the R Markdown language. R Markdown supports Latex, so you can make pretty equations like Professor Kapelner likes: (a^2 + b^2 = c^2). To type inline latex, just surround your code with dollar signs. That was published like this: $a^2 + b^2 = c^2$ You can edit the markdown for this book from RStudio just like you would edit a regular R Markdown (.Rmd) file. Here’s a picture of what it looks like as I edit this book and the R … Read more →

90

# The Queens College Guide to the Foundations of Life

2018-01-31

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown which uses Yihui Xie’s Bookdown Package for R. Every section (except for these first 2 sentences and section 3, the biochemistry guide) is from their sample book. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). The bookdown package can be installed from CRAN or Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level … Read more →

91

# YaRrr! The Pirate’s Guide to R

2018-01-22

An introductory book to R written by, and for, R pirates […] The purpose of this book is to help you learn R from the … Read more →

92

# Lab notes for Statistics for Social Sciences II: Multivariate Techniques

2018-01-20

Lab notes for Statistics for Social Sciences II: Multivariate Techniques […] Welcome to the lab notes for Statistics for Social Sciences II: Multivariate Techniques. Along these notes we will see how to effectively implement the statistical methods presented in the lectures. The exposition we will follow is based on learning by analyzing datasets and real-case studies, always with the help of statistical software. While doing so, we will illustrate the key insights of some multivariate techniques and the adequate use of advanced statistical software. Be advised that these notes are neither … Read more →

93

# Programming for Psychologists

2018-01-13

a book to accompany the course Programming for Psychologists […] This book is authored using [bookdown][https://bookdown.org/yihui/bookdown/]. … Read more →

94

# Estimacdiión en dominios pequeños

2018-01-12

Estedd lidbro plantea una introducción a la estimación de áreas pequeñas con el software R. […] Este libro plantea una introducción a la estimación de áreas pequeñas con el software R. xxxx vv zz second commit in Github This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need to … Read more →

95

# Field Guide to the R Ecosystem

2018-01-12

This guide aims to introduce the reader to the main elements of the R ecosystem. […] This work is licensed under a Creative Commons Attribution 4.0 International … Read more →

96

2018-01-10

97

# Financial Engineering Analytics: A Practice Manual Using R

2018-01-09

This book explores the fundamentals of financial analytics using R and various topics from finance. […] Science alone of all the subjects contains within itself the lesson of the danger of belief in the infallibility of the greatest teachers of the preceding generation. - Richard Feynman This book is designed to provide students, analysts, and practitioners (the collective “we” and “us”) with approaches to analyze various types of financial data sets, and to make meaningful decisions based on statistics obtained from the data. The book covers various areas in the financial industry, from … Read more →

98

# R och Demoskop

2018-01-09

Det här är ett dokument för att komma igång med R på Demoskop […] Det här är ett dokument om R på Demoskop. R är ett programmeringsspråk för statistisk analys. På Demoskop används R i huvudsak som ett komplement till den programmering som vi gör SAS och SPSS. Det här dokumentet är anpassat efter våra arbetssätt på Demoskop. Några generella förkunskaper behövs inte. Däremot så rekommenderar vi att du efter att du gjort installationen gör den här kursen på datacamp.com. Det är enkel introduktion till R och några paket som underlättar arbetsflödet. Datacamp är en bra hemsida för att lära sig … Read more →

99

# Github 介紹

2017-12-30

Github 介紹 […] 這裡我們用非程式設計者懂的說法來解釋，故不符合它們原始的完整定義。 Github.com: 一個【雲端空間】讓你儲存備份用 Github Desktop: 安裝在你電腦上的【備份小精靈】，透過他，你可以選擇將某個資料匣裡的東西備份在自己電腦，或進一步備份在Github.com雲端空間。 我們先假設你已經在Github.com（以下簡稱.com）註冊了一個帳號，也在你電腦安裝了Github Desktop（以下簡稱Desktop），並把Desktop設定好可以和你的.com帳號連結。 … Read more →

100

# Functional programming and unit testing for data munging with R

2017-12-28

This book is an introduction to functional programming and unit testing with the R programming language, for the purpose of data muning […] This book is still being written, some chapters are not finished yet, and there might be (there are) some typos. Don’t hesitate to write to me if you notice something weird. You can purchase a digital copy of this book at leanpub. The version on Leanpub will not always be up-to-date, I only update it when I made very big changes (new chapters, etc). But once this book will be finished, both version are going to be the same. This book serves to show how … Read more →

101

# R Markdown 介紹

2017-12-27

dplyr 介紹 […] 一個標準化的純文字語法（syntax），用來表達豐富的排版意境。 Wiki範例 本身不會產生word, html或pdf檔，而是透過其他應用程式，如pandoc，來進一步生成相關文件格式。 … Read more →

102

# dplyr 介紹

2017-12-22

dplyr 介紹 […] … Read more →

103

# IRT (GMMSGE01): Parametric IRT (dichotomous data)

2017-12-13

IRT (GMMSGE01): Parametric IRT (dichotomous data) […] Parametric item response theory (IRT) provides a theoretical framework that allows modeling the relationship [\text{item} \longleftrightarrow \text{person}] by means of a mathematical function: [P(X_i = c|\theta_n) = f(\theta_n)] (X_i) is the random variable denoting the answer to item (i), with discrete response categories; (\theta_n=) (n^\text{th}) person’s trait parameter. This is the item response function (IRF). The IRF is therefore a function relating the latent trait to the probability of answering the item correctly. … Read more →

104

# Economic Forum

2017-12-13

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. … Read more →

105

# Shiny (I)

2017-12-13

Shiny (I) … Read more →

106

# Geostatystyka w R

2017-11-26

Introduction to geostatistics with R (in Polish). […] Masz przed sobą skrypt zawierający materiały do ćwiczeń z geostatystyki. Składa się ona z kilkunastu rozdziałów pokazujących jak: dodawać i wizualizować dane przestrzenne w R (rozdział 2), wykonywać wstępną eksplorację danych nieprzestrzennych (rozdział 3), wstępnie analizować dane przestrzenne (rozdział 4), wykorzystywać deterministyczne metody interpolacji (rozdział 5), rozumieć i tworzyć przestrzenne miary podobieństwa i niepodobieństwa (rozdział 6), modelować semiwariogramy bezkierunkowe i kierunkowe (rozdział 7), tworzyć estymacje … Read more →

107

# Simulation And The James-Stein Estimator In R

2017-11-07

Simple Simulation and the James-Stein Estimator […] This is the website for “Simulation And The James-Stein Estimator In R”. This technical document is short, covering some common ways to generate data and exploring the James-Stein Estimator. This will teach you how to do run simulations to observe the properties of the James-Stein Estimator in R — specifically using the tidyverse: You’ll learn how to generate data to prove theoretical results. In the computer age of statistics the data scientist has the power of machines to run simulations for testing a methods before putting a method into … Read more →

108

# Data visualization

2017-11-06

This is a collection of data visualization handouts from Macroeconomics. … Read more →

109

# Muestreo y análisis de estudios educacionales con R

2017-11-05

Este es el repositorio del libro Diseño y análisis de estudios educacionales. […] Las fórmulas computacionales requeridas para estimar la varianza de estadísticas descriptivas como la media muestral están disponibles para algunos diseños complejos que incorporan elementos como la estratificación y el muestreo por conglomerados. Sin embargo, en el caso de estadísticas analíticas más complejas, tales como coeficientes de correlación y coeficientes de regresión, no se encuentra fácilmente las fórmulas específicas en diseños muestrales que se aparten del muestreo aleatorio simple. … Read more →

110

# Selected Solutions to R4DS Exercises

2017-11-03

This book provides selected solutions to the exercises in the wonderful book R for Data Science by Wickham Hadley. […] This is the website for “Selected Solutions to R4DS Exercises”. This is a joint advanture between Chunji Wang, Ron, Luna, Zhiyin, Chengcheng…. We started the “R4DS Study Club” on Sep 22nd, 2017; If you want to join us, please contact us! The chapter labels in this book is the same as the original R4DS book; go to the corresponding chapter for solutions. You might need to read the beginning of the chapter to load some packages or create some variables that are … Read more →

111

# R bookdownplus Textbook

2017-11-01

A tutorial to R bookdownplus, an extension of R bookdown package. This books shows helps you write academic journal articles, guitar books, chemical equations, mails, calendars, and diaries, on the basis of R bookdown. […] A book titled R bookdownplus Textbook is surely talking about ‘bookdownplus’ (Zhao 2017b), but let’s start with ‘bookdown’ (Xie 2016). ‘bookdown’ is a software package for writing books or documents based on R language (R Core Team 2016) and Markdown syntax. It is something like Microsoft Word, but more elegant, more powerful, and … Read more →

112

# Guide til klinikophold

2017-11-01

Denne side tjener som vejledning og inspiration til supervisorer og studerende på den præ-graduate, kliniske uddannelse på klinisk biomekanik. […] På disse sider finder du vejledning til de præ-graduate kliniskeophold for kiropraktor-studerende (stud. kand.manu) De præ-graduate klinikophold er opdelt som illustreret herover; med et præ-klinisk kursus på SDU efterfulgt af en ‘clinic-entrance’ eksamen, et længere ophold på rygcenter og 2 mindre ophold i andre regi. Teksten er opdelt i to hovedsektioner – én som primært er skrevet med supervisorerne for øje og én for studerende. Begge … Read more →

113

# Applications of Multivariate Analysis in Business

2017-10-29

This document describes the concept of Mass Customisation as it applies to Business Analytics and provides case study implementations of R Studio […] It has been great being part of the Analytical Community the last few years. The excitement is everywhere about “big-data”,“data-science”,“MOOCs”. The talent being attracted into Analytics is awe inspiring.One current trend is ‘a shift from a desire to work for bigger name brand companies like Facebook or Google, to more mission-driven organizations attempting to make an impact on society. Whether it is curing cancer, conserving energy, … Read more →

114

# What does the plant do?

2017-10-24

A Planter’s Punch that quickly got out of hand […] Plants collect energy from sunlight and use it to produce fruits that we eat, fibers that we wear and much, much more; in a process called photosynthesis. This process is fundamental for our life on Earth, and it has been intensively studied for centuries by scientists. We, scientists at CEPLAS, are also contributing. Here I’ll give you a glimpse of our scientific research. After a very short introduction to photosynthesis, I’ll explain to you one of its details and one of the methods that we are using to investigate it. The method is … Read more →

115

# ABJ Syllabus

2017-10-23

A track of papers we read and papers we collect to read in future. […] Para que o seu bookdown funcione tanto na web quanto no pdf, você deve evitar usar marcadores que dependem do contexto. Para fazer citações você deve usar (Weinstein 1997) ou Weinstein (1997). Isso também funciona pra pacotes (R Core Team 2017) ou R Core Team (2017). Para criar uma figura, é preferível que você use o print padrão do knitr. A label do gráfico será fig:label-do-chunk. Você pode citar fazendo 1.1. Se você precisar importar uma imagem de fora do R, é melhor que você faça ![](), a despeito do que diz o Yihui. … Read more →

116

# Studieren und Forschen mit dem Internet

2017-10-16

Arbeitsprozesse und Werkzeuge des wissenschaftlichen Arbeitens. Gekürzte Ausgabe aus 2001, aber viele Inhalte noch aktuell. […] Studieren und Forschen mit dem Internet von Peter Baumgartner & Sabine Payr ist lizenziert unter einer Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International Lizenz.Über diese Lizenz hinausgehende Erlaubnisse können Sie unter http://peter.baumgartner.name/kontakt erhalten. Studieren und Forschen mit dem Internet ist 2001 beim StudienVerlag herausgekommen und heute vergriffen. Restexemplare können nach wie vor gebraucht über Amazon … Read more →

117

# Mastering Software Development in R

2017-09-21

The book covers R software development for building data science tools. As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will obtain rigorous training in the R language, including the skills for handling complex data, building R packages and developing custom data visualizations. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers. Read more →

118

# IsoriX: Isoscape Computation and Inference of Spatial Origins using R

2017-09-17

This book is the official documentation for the R package IsoriX. […] … Read more →

119

# Course Notes for IS 6489, Statistics and Predictive Analytics

2017-09-03

Course notes for IS 6489. […] These are the course notes for IS 6489, Statistics and Predictive Analytics, offered through the Information Systems (IS) department in the University of Utah’s David Eccles School of Business. This is an exciting time for data analysis! The field has undergone a revolution in the last 15 years with increases in computing power and the availability of “big data” from web-based systems of data collection. “Data science” is the umbrella term that describes the result of this revolution—a new discipline at the intersection of many traditional fields such as … Read more →

120

# Lokal lagring og bruk av sensitive data

2017-08-31

Veiledning i installasjon og bruk av VeraCrypt for sikker lagring og sletting av data ved Senter for klinisk dokumentajson og evaluering (SKDE), Helse Nord RHF. […] Analyse av sensitive og tidsavgrensede data inngår som en del av de praktiske oppgaven SKDE har. Egenskapene til slike data vil typisk være at de kun skal nås av en begrenset og definert gruppe av brukere samt at de effektivt må kunne slettes ved gyldighetsperiodens utløp. Dette gir noen spesielle utfordringer når brukere samtidig skal kunne arbeide effektiv og dele slike data seg imellom. Typisk for analysevirksomhet er også at … Read more →

121

# Probability and Statistics

2017-08-28

These are the lecture notes for POS 5737, the introductory probability and statistics class in the graduate program in political science at Florida State University. […] These are the notes for POS 5737, taught in the Department of Political Science at Florida State University. They freely borrow from several well-known textbooks, including those by Wackerly, Mendenhall, and Scheaffer (2008), DeGroot and Schervish (2012), and Casella and Berger (2002). They also borrow from my own notes as a graduate student when I was taught by Kevin Clarke. Kevin was kind enough to provide his own old … Read more →

122

# R tips: 16 HOWTO’s with examples for data analysts

2017-08-27

R tips: 16 HOWTO’s with examples for data analysts […] … Read more →

123

# ModernDive

2017-08-23

An open-source and fully-reproducible electronic textbook bridging the gap between traditional introductory statistics and data science courses. […] Help! I’m new to R and RStudio and I need to learn about them! However, I’m completely new to coding! What do I do? If you’re asking yourself this question, then you’ve come to the right place! Start with our Introduction for Students. This is version 0.2.0 of ModernDive published on August 02, 2017. For previous versions of ModernDive, see Section 1.4. This book assumes no prerequisites: no algebra, no calculus, and no prior programming/coding … Read more →

124

# R Studio: A 3D Printer for Business Analytics

2017-08-06

This document describes the concept of Mass Customisation as it applies to Business Analytics and provides case study implementations of R Studio […] Good Morning! How are you doing? It’s been great being part of the Analytical Community the last few years hasn’t it? The excitement is everywhere about “big-data”,“data-science”,“MOOCs”. I have been blown away by the talent being attracted into Analytics.One current trend is ‘a shift from a desire to work for bigger name brand companies like Facebook or Google, to more mission-driven organizations attempting to make an impact on society. … Read more →

125

# Data Science in Educational Research

2017-07-30

This is an introduction and tutorial for data science in educational research. … Read more →

126

# Papa’s Three Laws

2017-07-20

This is a selection of a papa’s diary originally posted on my blog. A family’s stories of two children are told. This book is being updated. […] 我家有两个娃。大的是男孩，生于北京，唤作京生; 小的也是男孩，生于德国，唤作德生。 本书讲述的是我和我的朋友们的育儿和家庭故事。 … Read more →

127

# Data Science and Visualizations with R

2017-07-16

Data Science and Visualizations with R […] This is a course on the use of tidyverse packages tidyverse provides a complete suite of modern data-handling tools. It is an essential toolbox for any data scientist using R. The tidyverse package is designed to be easy to install. This course will dive into using tidyverse. It will assume you have already installed r and rstudio and how some familiarity on how to use the rstudio. This book will use the nycflights13 dataset This package contains information about all flights that departed from NYC in 2013: 336,776 flights with 16 variables. To … Read more →

128

2017-07-04

129

# (Very) basic steps to weight a survey sample

2017-07-03

(Very) basic steps to weight a survey sample […] This is an introductory guide to survey weighting. It provides a step-by-step walkthrough of the main procedures and explains the statistical principles behind them. The guide includes R code to implement all stages of survey weighting and reproduces the weighting procedures of the 7th European Social Survey in the UK. This text avoids technical notation and language and is targeted to social scientists with a basic level of statistics and probability theory. Readers without knowledge of R should be able to benefit from this text as it … Read more →

130

# The Unix Workbench

2017-06-29

The Unix Workbench […] Cover Image: A Goldsmith in his Shop by Petrus Christus This work by Sean Kross is licensed CC0. Zero rights … Read more →

131

# Gopnik Guide to Biology

2017-06-26

Bandymas sukurti lengvą biologijos elektroninę knygą. […] Internetas Lietuvos švietimo įrankiams kol kas turėjo mažai įtakos. Vietoje popierinės knygutės atsirado elektroninės knygutės, pasiruošti valstybiniams egzaminams atsirado programėlės. Bet šie įrankiai susiję su kontrolės struktūromis sekti moksleivio progresą ir įvertinti, ar jis teisingai pasiruošė egzaminui. Tikra edukacija prasideda ne nuo pažymių ar atsiskaitymo po 12 metų, o pirminio klausimo - kodėl? Pirminė nuostaba, jog aplinka neatitinka mūsų vidinio realybės modelio pastumia imtis veiksmų išsiaiškinti, kur mes klydome ir … Read more →

132

# Notes

2017-06-20

This is notes from yufree […] 这里的笔记主要来自于公开课笔记与相关教材的读书笔记，主题相对分散，但这些知识应该为当今科研人员的基本技能。 首先科研人员要有一定的数学与统计学功底，这是最最基本的工具学科。微积分、线性代数与数值方法是必须的数学工具，统计学工具则至少明白如何进行统计推断与预测。其余的要看应用，例如数论对密码学而言就是基础。 然后就是编程技能，编程方面首先要熟悉编程的思维方法，例如递归、迭代、条件语句等，也就是知道机器怎么运转。其次就是掌握一门高级语言，例如R、python或matlab，这样你可以快速实现自己的想法。 之后就是模型思维，懂得将实际问题抽象成一个概念问题或统计问题或仿真问 … Read more →

133

# Notes on R for AML100

2017-06-20

Notes on R for the course AML100 at Arizona State University. […] These notes introduce the basics of the programming language R as needed for the course AML100. Notes on RStudio and R Markdown are included in … Read more →

134

# Underlagsrapport för En ännu bättre strålbehandling avseende incidens och prevalens av cancer i Västra Sjukvårdsregionen 2016-2030

2017-06-16

Förutsägelse av framtida förekomst av cancer i Västra Sjukvårdsregionen. […] Rapporten presenteras i tre format, samtliga med samma text- och bildmässiga innehåll men med något olika tekniska lösningar. Om du läser denna rapports HTML-version så når du övriga format via nedladdningsikonen i sidhuvudet (se figur … Read more →

135

# 液体活检口袋书

2017-06-01

Liquid biopsy pocket book (in Chinese), written by Bioinformatics engineers. […] 海普洛斯推出【液体活检口袋书】专栏，对液体活检进行系统、全面的介绍。每周三更新，向大家介绍关于液体活检的一切。 … Read more →

136

# Detecting collusion in goverment procurement contracts

2017-05-26

This publication is the result of five months of work for our Data Product Architecture class project. […] Since 2002, the Mexican Federal government handles most of its procurement biddings through a transactional platform called Compranet. Even though most of the information in the platform is public, authorities and organizations dedicated to fight corruption do not have a technical framework to better allocate their resources into cases. Our project consisted in developing an interactive dashboard for investigators to track particular contracts and to filter out low-risk … Read more →

137

# GuitaR Bookdown

2017-05-25

This is a collection of my favorite songs with guitar chords, produced by bookdown. […] 最真的梦，就是用R语言的bookdown把R代码、作图、数据分析和吉他谱弄到一起。 啥？弄到一起有什么用？ 呃……容我清清脑子想一想…… 越过下面这座山丘，却发现无人等候…… 终会有一天　把心愿完成　带着你飞奔找永恒 [\int_0^\infty e^{-x^2} dx=\frac{\sqrt{\pi}}{2}] 本书的吉他谱，在网页上看不见，只有点击下载pdf才能看见哦。 … Read more →

138

# Föll í R - Dæmi

2017-05-23

Föll í R - Dæmi […] Hér eru dæmi um notkun á föllum sem ég hef skrifað og má finna á GitHub. Þetta eru aðallega föll sem spara mikinn tíma við uppsetningu á algengum töflum fyrir vísindagreinar (á sviði læknavísinda) en eru líka hjálpleg til þess að átta sig á fylgni milli mismunandi breyta í gagnasafninu. Þessi síða er búin til með bookdown. Það er frábær pakki sem tvinnar saman R markdown skrár og setur saman í aðgengilegt html-bókarsnið. Í öllum dæmunum er notast við ‘diabetes’ gagnasettið sem er aðgengilegt frá http://biostat.mc.vanderbilt.edu/wiki/Main/DataSets. The data consist of 19 … Read more →

139

# Egils saga Skalla-Grímssonar

2017-05-15

Egils saga Skalla-Grímssonar […] Texti Egils sögu var afritaður af vefsíðu The Icelandic Saga Database (sótt 15. maí 2017) og útbúinn fyrir birtingu hér með R markdown og bookdown pakkanum í R. Eyþór Björnsson, 15. maí … Read more →

140

# Mastering DFS Analytics

2017-05-12

Mastering DFS Analytics is a data-driven program to improve your daily fantasy sports results. You’ll learn and much more. Written by an applied mathematician, Mastering DFS Analytics will give you contest-tested tools. In addition to the ebook, you get Comments? Questions? @znmeb_dfs on Twitter Mastering DFS Analytics by M. Edward (Ed) Borasky is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Mastering DFS Analytics on … Read more →

141

# Literature thesis: Building a framework for retrieving information on multispecies interactions from published literature

2017-04-28

Literature thesis: Building a framework for retrieving information on multispecies interactions from published literature […] The generation of new global hypothesis, destined to understand our current global biodiversity crisis, requires a large amount of information. Our knowledge in Ecology is principally contained in the form of published articles. This global body of literature holds a significant amount of primary data on species distributions and interactions across a large geographical and temporal scale. In this literature review, I explore the use of different computational tools … Read more →

142

# The Art of Data Science

2017-04-26

The book covers R software development for building data science tools. As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will obtain rigorous training in the R language, including the skills for handling complex data, building R packages and developing custom data visualizations. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers. Read more →

143

# Tidyverse Cookbook

2017-04-21

Simple cookbook for functions and idioms within the scope of the tidyverse. […] The basic idea of this book is to provide a documentation of the tidyverse written in a solution driven cookbook style. As an extra I would like to provide similar solutions based on base R functionality. Some reasons to write this book: One strength of the tidyverse is that it hides a lot of quirks that base R provides and inherits to many packages that rely on it. This allows to stick to a specific workflow from the point you enter the tidyverse until you leave it. This is why I highly recommend to head your … Read more →

144

# An approach to identify the sources of low-carbon growth for Europe

2017-04-21

This website serves to illustrate the findings of the policy contribution ‘An approach to identify the sources of low-carbon growth for Europe’ and allows a deeper dive into the underlying data. […] This website serves to illustrate the findings of the policy contribution “An approach to identify the sources of low-carbon growth for Europe” (Zachmann 2016) and allows a deeper dive into the underlying data. The website is focused on presenting figures and deliberately only offers short descriptions and interpretations. The research underlying this report has been financially supported by the … Read more →

145

# Social Network Analysis in Education

2017-04-18

This is a course handbook written by Bodong Chen for his SNA course at UMN. […] This site is the course portal of CI5330 - Social Network Analysis in Education, taught by Prof. Bodong Chen at the University of Minnesota in Spring ’17. Content on this site is actively built and refined throughout the semester. This site or book is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Last update: 2017-04-17 … Read more →

146

# Notas sobre Estimación Puntual

2017-04-06

Se desarrolla el tema de estimación puntual para el curso Métodos en Bioestadística I perteneciente al Maestría en Bioestadística de la Universidad Javeriana […] En las siguientes páginas se desarrolla brevemente el tema de estimación puntual. Forma parte de una evaluación para el curso Métodos en Bioestadística I, perteneciente al Maestría en Bioestadística de la Universidad Javeriana. Este trabajo puede usado como una introducción, a manera de notas de clases o como un inicio de colaboración a un escrito más amplio y completo sobre estimación puntual. Cualquier crítica, aporte y/o … Read more →

147

# An approach to identify the sources of low-carbon growth for Europe

2017-04-03

Draft website for the European Climate Foundation […] This website serves to illustrate the findings of the policy contribution “An approach to identify the sources of low-carbon growth for Europe” (Zachmann 2016) and allows a deeper dive into the underlying data. The website is focused on presenting figures and deliberately only offers curt descriptions/interpretations. It is currently structured into five chapters but we plan to extend it when further steps of our analysis become available. The research underlying this report has been financially supported by the European Climate … Read more →

148

2017-03-20

149

# Advances on the analysis on connectivity of Raphia taedigera palm swamps for Central America

2017-03-13

Advances on the analysis on connectivity of Raphia taedigera palm swamps for Central America … Read more →

150

# Data lunch 2feb: The use of Bookdown to write documents and reports

2017-02-03

Data lunch 2feb: The use of Bookdown to write documents and reports […] Make sure you have installed the latest version of R and the Preview Release of RStudio. The following packages should be installed. If you have them already make sure they are updated. The most up to date versions are the “in development” versions from gitHub. Do you have Pandoc installed? RStudio should come along with Pandoc. and latex ? ( if you want to have PDF outputs as well) note that PDF does not allow interactive plots If you do not have latex installed Mac OS X –> MacTeX (http://www.tug.org/mactex/) Linux … Read more →

151

# 기초통계 개념정리

2017-01-29

This is a basic statistics book written by JSKIM. […] This is a basic statistics book written by Jinseob … Read more →

152

# Revealed comparative advantage and network centrality

2017-01-26

Revealed comparative advantage and network centrality … Read more →

153

# A Minimal Book Example

2017-01-08

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading #. To compile this example to PDF, you need to install XeLaTeX. 이 예제를 PDF로 컴파일하려면, XeLaTeX을 … Read more →

154

# R aplicado à Biologia: uma introdução descomplicada e divertida!

2017-01-07

Este é o livro ao vivo do blog Cantinho do R […] Este material foi construído com a ajuda de muitas pessoas que acreditam no LEQ e em Ciência Livre. Muito obrigado! Para mais material, visite o Cantinho do R Um prefácio da nova apostila do Cantinho do R Viva! Depois de uma longa demora (pelo menos para quem nos acompanha desde o começo), aqui está a nossa nova apostila do Cantinho do R! :D Se você é um recém chegado, acho que eu tenho que começar explicando aqui o que é, pra que serve e de onde nasceu este material, não é? É disso que se trata este primeiro capítulo. Mas não se preocupe, … Read more →

155

# Do not use averages with Likert scale data

2017-01-05

This is a short overview of why averages don’t work well for evaluating Likert scale or other ordinal-scale data, and what to do instead, with examples using R. While the examples are focused on healthcare surveys, the lessons apply to any use of ordinal scale data. Note: all of the data in this document is fake, created specifically to illustrate particular points. Contact/Twitter: @healthstatsdude PDF version: Website: https://bookdown.org/Rmadillo/likert/ Corrections/Pull requests: https://github.com/Rmadillo/likert Cover image: Gustave Doré, 1863. Illustration 12 for Cervantes’s Don … Read more →

156

# R Programming for Data Science

2016-12-22

The R programming language has become the de facto programming language for data science. Its flexibility, power, sophistication, and expressiveness have made it an invaluable tool for data scientists around the world. This book is about the fundamentals of R programming. You will get started with the basics of the language, learn how to manipulate datasets, how to write functions, and how to debug and optimize code. With the fundamentals provided in this book, you will have a solid foundation on which to build your data science toolbox. Read more →

157

# A list of R conferences and meetings

2016-12-21

A list of R conferences and meetings. […] This site attempts to list R conferences and local useR groups. Please feel free to add any missing group or conference. In particular, most of the associated twitter names are missing. There are currently 263 R user groups and events. To propose a change, just click the pencil icon in the top left hand corner. We also maintain a corresponding list of Data Science conferences and events. The html files for this document live in the docs/ directory of the repository. Travis creates the html files from the .Rmd files and commits them to the docs/ … Read more →

158

# Dengue Forecasting Project

2016-12-15

This is a book that contains experiments and results about the predictions of dengue outtbreaks in Thailand. […] This is a sample book written in Markdown. For now, you have to install the development versions of bookdown from Github: … Read more →

159

# Efficient R programming

2016-11-30

Efficient R Programming is about increasing the amount of work you can do with R in a given amount of time. It’s about both computational and programmer efficiency. […] This is the online version of the O’Reilly book: Efficient R programming. Pull requests and general comments are welcome. Colin Gillespie is Senior lecturer (Associate professor) at Newcastle University, UK. His research interests are high performance statistical computing and Bayesian statistics. He is regularly employed as a consultant by Jumping Rivers and has been teaching R since 2005 at a variety of levels, ranging … Read more →

160

# Interactive Data Visualization (2nd Day)

2016-11-23

Script developed for a workshop at the CUSO doctoral school on the 4th and 5th November 2016. […] This document serves as slides and script for the second day of the workshop Data Visualization taught by Paul C. Bauer and Richard Traunmüller for the Programme doctoral en science politique (PDSPO) (Bern, 4-5 of November 2016). The present material is licensed under a Creative Commons Attribution-ShareAlike License 3.0. Regarding further use of this material contact Paul. Some of the material is inspired by the official shiny tutorial and Plotly for R by Carston Sievert. For potential future … Read more →

161

# ggplot2逆引き集

2016-11-20

これはggplot2逆引き集です。 […] これはQiitaで公開されているggplot2逆引きの記事を集めたものです。今のところ，@kazutanが作成した12本をまとめています。 なにかありましたら，以下のGithubリポジトリのissueもしくはTwitterの@kazutanまでおねがいします。 … Read more →

162

# R Powered Web Applications with Shiny

2016-11-08

R Powered Web Applications with Shiny […] This is a book version, transcribed by Andrew Clark using RStudio’s bookdown package, of an extensive blog post by Zev Ross. The book version has the advantage of being available in several formats, more easily updated and downloadable. However, for an interactive version refer to the above mentioned blog … Read more →

163

# Premier League Annual

2016-10-29

Premier League Annual […] This is an ‘on the fly’ annual based on the 2016⁄17 Premier League season, updated weekly with charts, tables, highlight videos and trivia related to the games played. Each chapter features static visualizations relevant to the games that week. Greatly extended, fully-interactive and constantly updated versions can be found on the accompanying dashboard site Additional data is available at the Premier League Web site Most of the underlying data is unofficial, unguaranteed error-free and available for a million dollars. There is also likely to be use of James … Read more →

164

# GerminaQuant

2016-10-11

A guide for analisis of germination variables and usage of GerminaQuant. […] GerminaQuant allows make the calculation of the germination variables incredibly easy in an interactive applications build in R (R Core Team 2016), based in GerminaR and Shiny (Chang et al. 2016) package. GerminaQuant app is reactive!. Outputs change instantly as users modify inputs, without requiring a reload the app. The principal features of the application allow calculate the princiapal germination Variables, statistical analysis and easy way to plot the results. … Read more →

165

# Spark Social Science Manual

2016-10-03

Spark Social Science Manual […] Let the sample mean, (\hat{\mu}), be the parameter estimate for our mean parameter (\mu) and the null hypothesis of the t-test be (H_0): (/mu = 0). The test statistic is given by (\hat{\mu} / (\hat{\sigma} / \sqrt{n})). Remember that the p-value is determined by the test statistic and the t-distribution with ((n – 2)) degrees of freedom in this case. By the Central Limit Theorem, (\sqrt{n}*(\hat{\mu}-\mu) \rightarrow N(0,\sigma^2)) as (n \rightarrow \infty), or written differently as (\hat{\mu} \rightarrow \mu + \frac{\sigma}{\sqrt{n}}N(0,1)) … Read more →

166

# Multivariate Analysis with Optimal Scaling

2016-09-28

In 1980 members of the Department of Data Theory at the University of Leiden taught a post-doctoral course in Nonlinear Multivariate Analysis. The course content was sort-of-published, in Dutch, as Gifi (1980). The course was repeated in 1981, and this time the sort-of-published version (Gifi (1981)) was in English. The preface gives some details about the author. The text is the joint product of the members of the Department of Data Theory of the Faculty of Social Sciences, University of Leiden. ‘Albert Gifi’ is their ‘nom de plume’. The portrait, however, of Albert Gifi shown here, is that … Read more →

167

# Econ 215 Notes

2016-09-26

Lecture notes for my introduction to statistics class at University of Nebraska-Lincoln. […] This is supposed to be your first course in statistics. So the goal is to give you an overview of what statistics is, why it is a powerful thing to know, how you can use it to make informed decision or understand “numbers speak” people throw around in the news. At the end of this class, I hope: 1- You understand the importance of statistics; 2- You can better appreciate the numbers you get from the news; 3- You can perform your own analysis to inform yourself, and your collaborators. The explosion … Read more →

168

# Exploratory Data Analysis with R

2016-09-14

This book covers the essential exploratory techniques for summarizing data with R. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data you have. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing informative data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data. Read more →

169

# Getting used to R, RStudio, and R Markdown

2016-09-13

An introduction into using R, RStudio, and R Markdown for new users […] In the HTML version of this book, you can also download the PDF version of the book by clicking on PDF button in the top toolbar of the page. HTML is the preferred format but the PDF format may be preferred for some readers. Links to the different GIFs directly found in the HTML version are provided in the PDF version. This resource is designed to provide new users to R, RStudio, and R Markdown with the introductory steps needed to begin their own reproducible research. A review of many of the common R errors … Read more →

170

# Руководство по data.table

2016-09-12

Руководство по пакету data.table: перевод виньеток, справочная иформация. […] Вступление Данное руководство содержит переводы всех виньеток по пакету data.table. Все, кроме последней, переведены с версий от июня 2015 г.; последняя - с версии от апреля 2016 г. Переводы будут актуализироваться, также планируется добавить другие материалы. … Read more →

171

# Principles of Econometrics with R

2016-09-01

This is a beginner’s guide to applied econometrics using the free statistics software R. […] … Read more →

172

# Chess Encounters

2016-08-10

Chess Encounters […] … Read more →

173

2016-07-31

174

# useR2016 Conference Videos

2016-07-19

Chart, interactive table and a selection of videos from the useR2016 conference […] This acts as a repository for some of my favourite video talks from the recent useR2016 conference along with the ability to view any of the offerings via a clickable table. It is probably not the most effective of presentation but is a trial run for creating and deploying interactive books to bookdown.org Andrew Clark is an independent R developer based in North Vancouver He has for many years supplied statistical sports data on the web but with the interactive opportunities arising from the shiny framework … Read more →

175

# Scalable Machine Learning and Data Science with Microsoft R Server and Spark

2016-06-01

These are (tentatively) rough notes showcasing some tips on conducting large scale data analysis with R, Spark, and Microsoft R Server. The focus is primarily on machine learning with Azure HDInsight platform, but review other in-memory, large-scale data analysis platforms, such as R Services with SQL Server 2016, and discuss how to utilize BI tools such as PowerBI and Shiny for dynamic reporting, and report generation. Read more →

176

# Shiny Tutorial

2016-05-25

This is a shiny tutorial. […] Some basic knowlege about the R lanuage is requred. It would be helpful if you have some basic knowlege about HTML, CSS and javascript, but they are not … Read more →

177

# Backtesting Strategies with R

2016-05-06

Backtesting strategies with R […] This book is designed to not only produce statistics on many of the most common technical patterns in the stock market, but to show actual trades in such scenarios. Test a strategy; reject if results are not promising Apply a range of parameters to strategies for optimization Attempt to kill any strategy that looks promising. Let me explain that last one a bit. Just because you may find a strategy that seems to outperform the market, have good profit and low drawdown this doesn’t mean you’ve found a strategy to put to work. On the contrary, you must work to … Read more →

178

# Praktiskā biometrija

2016-04-19

Piemēri darbā ar programmu R, lai risinātu statistikas problēmas bioloģijā. […] Praktiskā biometrija Šī grāmata ir mans mēģinājums samērā vieglā formā ar minimālu teorijas materiālu sniegt praktiskus padomus statistisko analīžu veikšanā biologiem. Tā kā uzsvars ir likts uz vārdu ‘’praktiski’’, tad lielāko grāmatas daļu sastāda piemēri tam, kā veikt katru no apskatītajiem statistiskajiem testiem. Plašāka teorētiskā pamatojuma iegūšanai noderēs citu autoru darbi. Nenoliedzami nopietnākais darbs latviešu valodā biometrijas jomā ir jāmin Liepa (1974) grāmata, angļu valodā tas būtu kāds no … Read more →

179

# A Minimal Book Example

2016-04-12

This is a minimal example of using the bookdown package to write a book. The output format for this example is bookdown::gitbook. […] This is a sample book written in Markdown. You can use anything that Pandoc’s Markdown supports, e.g., a math equation (a^2 + b^2 = c^2). For now, you have to install the development versions of bookdown from Github: Remember each Rmd file contains one and only one chapter, and a chapter is defined by the first-level heading … Read more →

180

# Block Relaxation Methods in Statistics

2016-04-01

The book discusses block relaxation, alternating least squares, augmentation, and majorization algorithms to minimize loss functions, with applications in statistics, multivariate analysis, and multidimensional scaling. […] Many recent algorithms in computational statistics are variations on a common theme. In this book we discuss four such classes of algorithms. Or, more precisely, we discuss a single large class of algorithms, and we show how various well-known classes of statistical algorithms fit into this common framework. The types of algorithms we consider are, in logical order, … Read more →

181

# APL in R

2016-04-01

R versions of the array manipulation functions of APL are presented. We do not translate the system functions or other parts of the runtime. Also, the current version has does not have the nested arrays of APL2. […] APL was introduced by Iverson (1962). It is an array language, with many functions to manipulate multidimensional arrays. R also has multidimensional arrays, but not as many functions to work with them. In R there are no scalars, there are vectors of length one. For a vector x in R we have dim(x) equal to NULL and length(x) > 0. For an array, including a matrix, we have … Read more →

182

# 16S rRNA analysis

2018-08-14*

Documentation describing my analyses of 16S rRNA sequencing data. […] My name is Rachael Lappan, and I am a PhD candidate at the University of Western Australia. The core of my PhD work is the Perth Otitis Media Microbiome (biOMe) study, where I work on the upper respiratory tract microbiome in children with recurrent acute otitis media (middle ear infections). The first stage of this research involved characterising the microbiome (by 16S rRNA gene sequencing) on samples from children with ear infections compared with samples from seemingly resistant healthy controls. The paper can be … Read more →

183

# Advanced R

2018-08-14*

This is the website for work-in-progress 2nd edition of “Advanced R”, a book in Chapman & Hall’s R Series. The book is designed primarily for R users who want to improve their programming skills and understanding of the language. It should also be useful for programmers coming to R from other languages, as it explains some of R’s quirks and shows how some parts that seem horrible do have a positive side. This edition is a work in progress. If you’re looking for the electronic version of the 1st edition, you can find it online at http://adv-r.had.co.nz/. You may also be interested in: “R for … Read more →

184

2018-08-14*

185

# Circular Visualization in R

2018-08-14*

This book provides a comprehensive overview of implementing circular visualization in R by cirlize package, espeically focusing on visualizaing high dimentional genomic data and revealing complex relationships by Chord diagram. […] This is the documentation of the circlize package. Examples in the book are generated under version 0.4.2. If you use circlize in your publications, I would be appreciated if you can cite: Gu, Z. (2014) circlize implements and enhances circular visualization in R. Bioinformatics. DOI: 10.1093/bioinformatics/btu393 … Read more →

186

# CookDown

2018-08-14*

A collection of recipes. […] This is a collection of recipes written in Bookdown. Feel free to … Read more →

187

# Data Science Live Book

2018-08-14*

An intuitive and practical approach to data analysis, data preparation and machine learning, suitable for all ages! […] This book is now available at Amazon. Check it out! 📗 🚀. Link to the black & white version, also available on full-color. It can be shipped to over 100 countries. 🌎 The book will facilitate the understanding of common issues when data analysis and machine learning are done. Building a predictive model is as difficult as one line of R code: That’s it. But, data has its dirtiness in practice. We need to sculp it, just like an artist does, to expose its information in order … Read more →

188

# Data Visualization

2018-08-14*

A practical introduction. […] Forthcoming, Princeton University Press. Incomplete draft. This version: 2018-04-25. You should look at your data. Graphs and charts let you explore and learn about the structure of the information you collect. Good data visualizations also make it easier to communicate your ideas and findings to other people. Beyond that, producing effective plots from your own data is the best way to develop a good eye for reading and understanding graphs—good and bad—made by others, whether presented in research articles, business slide decks, public policy advocacy, or … Read more →

189

# Forecasting: Principles and Practice

2018-08-14*

2nd edition […] Welcome to our online textbook on forecasting. This textbook is intended to provide a comprehensive introduction to forecasting methods and to present enough information about each method for readers to be able to use them sensibly. We don’t attempt to give a thorough discussion of the theoretical details behind each method, although the references at the end of each chapter will fill in many of those details. The book is written for three audiences: (1) people finding themselves doing forecasting in business when they may not have had any formal training in the area; … Read more →

190

# Fundamentals of Data Visualization

2018-08-14*

A guide to making visualizations that accurately reflect the data, tell a story, and look professional. […] This is an online preview of the book “Fundamentals of Data Visualization” to be published with O’Reilly Media, Inc. Completed chapters will be posted here as they become available. The book is meant as a guide to making visualizations that accurately reflect the data, tell a story, and look professional. It has grown out of my experience of working with students and postdocs in my laboratory on thousands of data visualizations. Over the years, I have noticed that the same issues … Read more →

191

# R for Data Science

2018-08-14*

This book will teach you how to do data science with R: You’ll learn how to get your data into R, get it into the most useful structure, transform it, visualise it and model it. In this book, you will find a practicum of skills for data science. Just as a chemist learns how to clean test tubes and stock a lab, you’ll learn how to clean data and draw plots—and many other things besides. These are the skills that allow data science to happen, and here you will find the best practices for doing each of these things with R. You’ll learn how to use the grammar of graphics, literate programming, and reproducible research to save time. You’ll also learn how to manage cognitive resources to facilitate discoveries when wrangling, visualising, and exploring data. Read more →

192

# R for Data Science Solutions

2018-08-14*

This contains the solutions to the exercises in the book, R for Data Science, by Garrett Grolemund and Hadley Wickham. […] This contains solutions to the exercise in R for Data Science, byn Hadley Wickham and Garret Grolemund (Wickham and Grolemund 2017). The website for that book is r4ds.had.co.nz, and a physical copy is published by O’Reilly and available from amazon. This work is licensed under a Creative Commons Attribution 4.0 International License Wickham, Hadley, and Garrett Grolemund. 2017. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. 1st ed. O’Reilly … Read more →

193

# edav.info/

2018-08-14*

Everything you need to do well in EDAV […] Students often want to know how they can excel in a course. The standard answer given is usually something like: Just read the syllabus, focus on the topics discussed therein, and be able to understand their nuances. — Typical Prof This answer is often given after a quick sigh and delivered in a surprisingly condescending tone. We don’t like this answer. Our answer is to provide you with edav.info/. This site is one of the best ways to help you with this course. We hope that you find the confidence to dive in and explore this resource and its … Read more →

194

# plotly for R

2018-08-14*

An overview of the R package plotly […] This website explains and partially documents the R package plotly, a high-level interface to the open source JavaScript graphing library plotly.js (which powers plot.ly). The R package already has numerous examples and documentation on https://plot.ly/r and https://plot.ly/ggplot2, but this website provides more of a cohesive narrative to help explain fundamental concepts and recent developments. By reading from start to finish, readers new to R and plotly should be able to get up and running fairly quickly. That being said, advanced R and plotly … Read more →

195

2018-08-14*

196