Decision Tree Classification of Diabetes among the Pima Indian Community in R using mlr

In my last post I conducted EDA on the Pima Indians dataset to get it ready for a suite of Machine Learning techniques. My second post will explore just that. Taking one step at a time, my incoming posts will include one Machine Learning technique showing an in-depth (as in-depth as I can get) look at how to conduct each technique. Lets start with Decision Trees!

Diabetes Among The Pima Indians: An Exploratory Analysis

In this post we will explore the Pima Indian dataset from the UCI repository. This post will aim to showcase different ways of thinking of your data. Most novices to data science would rush into data preprocessing and not explore the data properly. The data cleaning stage can be subjective at times and here I offer my own view and opinions on this dataset

