The area under the curve is also increasing on the validation set and indicates that survived passengers 84.4671% more often are classified as belonging to class of survival than those passenger who have died. Finally, I will estimate such classification method as the Naïve Bayes Classifier that is based on determining probabilities for the outcomes. Implementing the full model,
I receive some evidence of overfitting for the full model due to the high dimensions of the model. As implemented in a previous method, I will create a reduced model with the variables that were identified as the most significant for the reduced logistic regression model:
Model Error Rate (training) Error Rate (validation)
Full model 25.5102