Introduction
I have included some nice reference materials so that I can always get to this information quickly. These are pdf files that you can download.
- The Periodic Table of Alteryx Tools – Front Side
- The Periodic Table of Alteryx Tools – Back Side
- Regular Expressions Cheat Sheet
Alteryx Concepts
Tableau Concepts
Online References
- Tableau Mapping
- Parsing Command Line Parameters (thanks to Joe Mako)
Alteryx and R
Here are key links and cheat sheets. All of the cheat sheet information was developed by Alteryx User SydneyF. Look at the end of this page to find an entire data science curriculum based in R from the group called DataCamp.
From Sydney: The R Tool can be used to run any R code from Alteryx Designer. The R tool comes with a few Alteryx-specific R packages that allow you to use the R tool seamlessly in a workflow, or even create an R-based macro. To assist you in these endeavors, we’ve developed the R Tool Cheat Sheet with these functions, which you can download to have as your very own.
- Alteryx R Documentation Page
- A cheat sheet of R functions to use in Alteryx
- AlteryxRCheatSheet
- data-wrangling-cheatsheet
- devtools-cheatsheet
- ggplot2-cheatsheet
- Short-refcard
- YanchangZhao-refcard-data-mining
R-Based Data Science Curriculum
DataCamp Courses
DataCamp soundly believes in educating people to be the best data scientists possible. As such, they allow students to take as many classes as they would like for free while enrolled…and there are a LOT to choose from, not only in R but in Python, SQL, and others.  Below is a comprehensive list of classes that were available in Jan 2017.
Make sure to register for Data Camp.
R Programming
- Introduction to RÂ – Mostly working with data structures like vectors, matrices, factors, dataframes and lists.
- Intermediate RÂ – If/then, loops, functions, the apply family, functions and debugging, working with text via regular expressions and substitutions, working with dates.
- Working With Dates and Times in RÂ – Using the lubridate package.
- Writing Functions in RÂ – Uses the purr package to help write functions and is the “dplyr” of function-writing; course covers handling errors, arguments, etc., and it a bit more advanced treatment.
- Writing Efficient R Code – Benchmarking/timing, profiling, parallel programming, very advanced stuff.
- Reporting with R Markdown – Those .Rmd files you’re always using.
Reading in, Cleaning, and Manipulating Data
- Importing Data in R part 1Â – Using the readr and data.table packages for reading in data, reading in Excel data, XLConnect with Excel.
- Importing Data in R part 2 – Importing from databases, using SQL in R, importing data from the web with API, JSON, importing data from SAS, STATA, SPSS, etc.
- Cleaning Data in RÂ – Uses the tidyr package to help separte/unite columns, handle messy data, do string and data type conversion, handle missing values and data errors.
- Manipulating data with dplyr – Learning the dplyr and tbls packages for selecting, mutating, filtering, arranging, grouping, summarizing, and aggregating data.
- Introduction to Data – An overview of working with data with some cautionary tales, uses dplyr.
- Data Table Manipulation in RÂ – Another class using dplyr.
- Joining Data in R with dplyr – Another dplyr course; using this package will make you an expert in manipulating data.
- Introduction to Spark in R using sparklyr – Big Data topics.
Working with and Summarizing (Structured) Data
- Exploratory Data Analysis – Charts, tables, counts vs proportions, histograms, boxplots, density plots, numerical and graphical summaries, case study.
- Case Studies in Importing and Cleaning Data in RÂ – A few case studies were data issues crop up such as warnings, dates, removing redundant data, readxl, data type conversions, separating columns, replacing missing values, removing useless columns, splitting data.
- Exploring Pitch Data with RÂ – An extended case study in Baseball using tapply, prop.table, ggplot, and “for” loops to do some analytics.
- Exploratory Data Analysis in R Case Study – Practice with dplyr and ggplot2, intro to broom package and tiny, looks at UN General Assembly voting history.
- Case study in Credit Risk Modeling – Desn’t seem like a financial analytics example exclusively but rather a great case study.
Visualization
- Data Visualization in RÂ – Overview of plot, lines, points, par, adding text, lines, legend functions, histogram, boxplot, etc., making effective plots and plot layouts.
- Data Visualization with ggplot2 Part 1, Part 2, Part 3Â – if you ever want to be a true expert at making ridiculously great and flexible visualizations, this sequence is for you.
- Communicating with Data in the tidyverse – How to make great graphics and presentations with ggplot2 and markdown.
- Data Visualization with lattice – Another package for making great plots.
- Data Visualization with ggvis – Another package for making great plots.
RegressionÂ
- Correlation and Regression – Basics.
- Inference for Linear Regression – Variability of coefficients, simulation and bootstrapping of coefficients, assumptions of model and what to do when violated.
- Multiple and Logistic Regression – Handling categorical predictors, interactions, adjusted R2, case study with Italian restaurants.
- Supervised Learning in R with Regression – Machine learning perspective of regression, uses some ggplot2 and tree-based models, one-hot encoding of categorical variables with designTreatmentZ.
Machine Learning and Data MiningÂ
- Introduction to Machine Learning – Overview and basic algorithms, performance measures and bias/variance, crossvalidation.
- Machine Learning with Tree Based Models – Rpart, randomForest, gbm.
- Supervised Learning / Classification in RÂ – Nearest neighbor, naive bayes, logistic regression, classification trees and random forest.
- Unsupervised Learning in RÂ – Kmeans and hierarchical clustering, dimension reduction with PCA case study, Pokemon data.
- Machine Learning Toolbox – Learning with caret, linear/logistic regression, tuning parameters, pre-processing data, selecting and comparing models.
Time Series and ForecastingÂ
- Introduction to Time Series Analysis – Autoregression and simple moving averages.
- Visualizing Time Series – Mostly using plot(), case study in selecting a stock that improves portfolio.
- ARIMA modeling in RÂ – Seasonal and non-seasonal
- Forecasting with RÂ – ARIMA, smoothing methods, dynamic harmonic regression, TBATS.
- Manipulating Time Series Data and Case Studies and Forecasting – Using xts and zoo, cases about flights, weather, unemployment, GDP, sports
- String Manipulation in R with stringr – Formatting characters and strings, regular expressions replacements, case study.
- Introduction to Text Mining and Bag of Words – Basic handling of data, word clouds, distance matrices and dendrograms, n-grams, case study.
- Sentiment Analysis in RÂ – Qdap’s sentiment function polarity(), visualizing sentiment, airBnB reviews.
- Sentiment Analysis the tidy way – Tweets, Shakespeare, TV news, songs.
- Working with Web Data in RÂ – API, httr to interact with APIs, JSON, XLM, scraping with XPATHs, CSS web scraping.
Probability and StatisticsÂ
- Foundations of Probability – binomial distribution, simulations, Bayesian statistics, Poisson and Geometric distributions, the replicate function.
- Foundations of Inference – Randomization, hypothesis tests, confidence intervals, bootstrapping.
- Inference for Numerical Data – Bootstrapping, t-tests, differences in averages, ANOVA for comparing many averages.
- Statistical Modeling in R Part 1 and Part 2Â – Performing statistical models from a different point of view.
Spatial Analysis (geo-spatial statistics)
Network Analysis in R (e.g., social networks)
- igraph – Is an amazing package in R that handles nearly every aspect of network analysis.
Finance in RÂ
- Intro to Finance in RÂ
- Intermediate Finance in RÂ
- Importing and Managing Finance DataÂ
- Intro to Credit Risk ModelingÂ
- Financial TradingÂ
- Equity ValuationÂ
- Introduction to Portfolio AnalysisÂ
- Intermediate Portfolio AnalysisÂ
- Bond ValuationÂ
- Quantitative Risk ManagementÂ