Machine Learning
Supervised Learning : Predictive modeling, linear algebra, classification; binary, mulitclass, regression:continous values, knn, logistic regression
feature vector ; qualitative : categorical data - ordinal data ; nominal data - one hot encoding ;
quantitative : numerical; continous, discrete
Unsupervised Learning: structure/representing data
Reinforcement Learning
Labeled data
model training
python Packages
scikit learn
stats model
matplotlib
numpy : numerical computation
scipy
sklearn
pandas
seaborn
exploratory data analysis
model training
precision and recall
loss and validation set
model evaluation procedures: train and test on entire dataset; evaluation metrics;
classification and prediction accuracy
training accuracy; overfit; underfit
decision boundary
train test split ; random state with 4 ; train, test, validation dataset
ramdom sampler
training and testing acuracy
feature selection
confusion matrix
model selection feature selection (cross validation for parameter tuning)
model complexity
predictions of out of sample data
k fold cross validation
ROC Curve and AUC
parameter tuning
optimal tuning parameters
bias and variance
cross valu score
scikit learn grid search and randomize search cross validate one hot encoding
models
linear regression; intercept and coefficient
knn
bayes classification
MLOPS
References: https://www.pinecone.io/learn/k-nearest-neighbor/
Last updated
Was this helpful?