<?xml version="1.0" encoding="UTF-8"?>
<mods xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" version="3.1" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
  <titleInfo>
    <title>Introduction to data science</title>
    <subTitle>data analysis and prediction algorithms with R</subTitle>
  </titleInfo>
  <name type="personal">
    <namePart>Irizarry, Rafael A.</namePart>
    <role>
      <roleTerm authority="marcrelator" type="text">creator</roleTerm>
    </role>
    <role>
      <roleTerm type="text">author.</roleTerm>
    </role>
  </name>
  <typeOfResource>text</typeOfResource>
  <originInfo>
    <place>
      <placeTerm type="code" authority="marccountry">flu</placeTerm>
    </place>
    <dateIssued encoding="marc">2019</dateIssued>
    <issuance>monographic</issuance>
  </originInfo>
  <language>
    <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
  </language>
  <physicalDescription>
    <form authority="marcform">print</form>
    <extent>pages cm</extent>
  </physicalDescription>
  <abstract>"The book begins by going over the basics of R and the tidyverse. You learn R throughout the book, but in the first part we go over the building blocks needed to keep learning during the rest of the book"--</abstract>
  <tableOfContents>Installing R and RStudio -- Getting started with R and RStudio -- R Basics -- Programming basics -- The tidyverse -- Importing data -- Introduction to data visualization -- ggplot2 -- Visualizing data distributions -- Data visualization in practice -- Data visualization principles -- Robust summaries -- Introduction to statistics with R -- Probability -- Random variables -- Statistical inference -- Statistical models -- Regression -- Linear models -- Association is not causation -- Introduction to data wrangling -- Reshaping data -- Joining tables -- Web scraping -- String processing -- Parsing dates and times -- Text mining -- Introduction to machine learning -- Smoothing -- Cross validation -- The caret package -- Examples of algorithms -- Machine learning in practice -- Large datasets -- Clustering -- Introduction to productivty tools -- Accessing the terminal and installing Git -- Organizing with Unix -- Git and GitHub -- Reproducible projects with RStudio and R markdown.</tableOfContents>
  <note type="statement of responsibility">Rafael A. Irizarry.</note>
  <subject authority="lcsh">
    <topic>R (Computer program language)</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Information visualization</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Data mining</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Statistics</topic>
    <topic>Data processing</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Probabilities</topic>
    <topic>Data processing</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Computer algorithms</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Quantitative research</topic>
  </subject>
  <classification authority="lcc">QA276.45.R3 I75 2019</classification>
  <classification authority="ddc" edition="23">Grad. 005.362  I689 2020</classification>
  <relatedItem type="otherFormat" displayLabel="Online version:">
    <titleInfo>
      <title>Introduction to data science</title>
    </titleInfo>
    <name>
      <namePart>Irizarry, Rafael A.</namePart>
    </name>
    <originInfo>
      <publisher>[Boca Raton] : [CRC Press], [2019].</publisher>
    </originInfo>
    <identifier type="local">(DLC)  2019025161</identifier>
  </relatedItem>
  <identifier type="isbn">9780367357986</identifier>
  <identifier type="lccn">2019025160</identifier>
  <recordInfo>
    <recordContentSource authority="marcorg">LBSOR/DLC</recordContentSource>
    <recordCreationDate encoding="marc">190701</recordCreationDate>
    <recordChangeDate encoding="iso8601">20251027205640.0</recordChangeDate>
    <recordIdentifier source="OSt">21065296</recordIdentifier>
    <languageOfCataloging>
      <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
    </languageOfCataloging>
  </recordInfo>
</mods>
