You have been assigned to run a Logistic Regression model for 100 countries each. All data is currently stored in a PostgreSQL database. Which tool/library should be used to produce these models with the least effort?
You have created a scatterplot of two continuous variables for 2000 records. You want to add a line to the scatterplot to check linearity of the data. Which function would best address this need?
Refer to the exhibit, which shows pairwise counts for items purchased together. Consider the following association rules: - Milk -> Eggs - Eggs -> Milk - Bread -> Milk - Milk -> Bread Which rule has a confidence higher than 70%?
You build a decision tree to classify five different types of customers based on their browsing history from a sample of 500. The resulting decision tree has 17 layers. One of the leaf nodes has only three customers. What do you conclude?