Course curriculum

    1. Introduction

    2. Meet the Instructor

    3. Course Outline

    4. Starter Code

    1. Theoretical Concepts of Text Representation

    2. Structuring One Document Corpus

    3. Structuring a Multiple Document Corpus

    4. Setting Parameters

    5. Using TF-IDF Representation

    6. Reading Data from a Labeled Dataset

    7. Using Textual Dataset from UCI Repository

    1. Machine Learning Overview

    2. K-Nearest Neighbors Classifier

    3. Naive Bayes Classifier

    4. Decision Tree Classifier

    5. Linear Classifier

    6. Concluding Remarks on Classifiers

    7. Classifiers Implementation with Default Settings

    8. Classifiers with Different Parameter Settings

    9. Classification with a UCI Repository Dataset

    1. Introduction to Clustering

    2. K-means Clustering

    3. Implementing Partitional Clustering

    4. Agglomerative Clustering with Default Settings

    5. Agglomerative Clustering with Parameters

    6. Clustering UCI Repository Dataset

    1. Cross Validation

    2. Validation

    3. K-Fold Cross Validation

    4. Leave One Out Validation

    5. Classifiers Evaluation

    6. Predictive Accuracy of KNN using KFold

    7. Precision, Recall and F1-measure

    8. Confusion Matrix

    9. Putting it all Together

    10. Clustering Evaluation Techniques

    11. Implementing Clustering Evaluation

    1. Text Normalization

    2. Lowercase, Whitespaces, Punctuations

    3. Removing Stopwords

    4. Stemming and Lemmatization

    5. Regular Expressions

    6. Applying Regular Expressions

    7. Parts-of-speech Tagging

    8. Data Acquisition

    9. Text Segmentation and Tokenization

About this course

  • $9.99
  • 66 lessons
  • 5.5 hours of video content

Discover your potential, starting today