Skip to content

Latest commit

 

History

History
18 lines (12 loc) · 775 Bytes

README.md

File metadata and controls

18 lines (12 loc) · 775 Bytes

Big data

Eksamensprojekt i Big Data.

Dependencies

  • Numpy
  • Pandas
  • Scikit-learn
  • JUPYTER

Data

Telco Customer Churn: https://www.kaggle.com/blastchar/telco-customer-churn/version/1?select=WA_Fn-UseC_-Telco-Customer-Churn.csv

Characteristics

There are 7043 instances with 21 features of which 12 are strings (more than 2 options of answer), 5 booleans (yes or no answers), and 4 Integer. Here 20 of the features are listed: “customerID, gender, SeniorCitizen, Partner, Dependents, tenure, PhoneService, MultipleLines, InternetService, OnlineSecurity, OnlineBackup, DeviceProtection, TechSupport,StreamingTV, StreamingMovies, Contract, PaperlessBilling, PaymentMethod, MonthlyCharges, TotalCharges”.
The last feature “Churn” is the target value.