2 minutes

R Programming 101- Datacamp Beginner

Datacamp Beginner Course Notes

Symbol

Assign is used as <- rather than =.

Arithmetic Operations

Same as other languages.
Modulo is represented by %%.
Logical variables True or False represented by TRUE and FALSE.

Matrix and Vectors

Index starts at 1
Methods:
- rowSum : Find sum across a row.
- colSum
- colnames: Specify column names to a matrix.
- rowNames

Factors

Basically these are used for categorical variables.
If we have a vector with categorical values, we can use factor() to get it’s categorical values.

Types

Nominal : Categorical Variable without implied order.
Ordinal: Categorical Variable with order.
- e.g. [0, 1, 2]
- Defined by passing parameter order = TRUE.
- Also need to pass levels as levels = vector.
- By specifying levels for the input vector, we can specify levels and then later use summary(factors_vector) to get the summary on it’s levels.

DataFrame

Creating:
- data.frame(array1, array2… )
Indexing:
- df$column to get only one column of data.
- Similar indexing as numpy matrices
- Also can use “column names” to get elements by index.
Filtering :
- subset(df, condition)
Sorting:
- order()
- Using order to sort whole dataframe, we need to get index using order() and later use it to sort by indexing into the dataframe as df[sorted_indexes, ]

NOTE: We have a built in dataframe known as mtcars.

Lists

One dimensional arrays which we call lists in python are called vectors here. Lists basically mean collection of different elements: can be of different types as well.
Also can give enumerations for list items using names() which is later used to index list similar to dataframes. e.g. list$name

Backlinks

Data Scientist with R

265 Words

2020-09-22 00:00 +0545

Read other post

← Git Commands R Programming 101- Datacamp Intermediate →