Menu

Uc Irvine Machine Learning Repository Contains Many Datasets Conducting Computer Science R Q43788179

The UC Irvine Machine Learning repository contains many datasetsfor conducting computer science research. One such dataset is theHaberman’s Survival dataset, available athttp://archive.ics.uci.edu/ml/datasets/Haberman’s+Survival . Thefile “haberman.data” contains survival data for breast cancerpatients in comma-separated value format. The first field is thepatient’s age at the time of surgery, the second field is the yearof the surgery, the third field is the number of positive axillarynodes detected, and the fourth field is the survival status. Thesurvival status is 1 if the patient survived 5 years or longer and2 if the patient died within 5 years. Write a Java Console or GUIbased application program that reads the CSV file and calculatesthe average number of positive axillary nodes detected for patientsthat survived 5 years or longer, and the average number of positiveaxillary nodes detected for patients that died within 5 years. Asignificant difference between the two averages suggests whether ornot the number of positive axillary nodes detected can be used topredict survival time. Your program should ignore the age and yearfields for each record.

Please use Java program.

Expert Answer


Answer to The UC Irvine Machine Learning repository contains many datasets for conducting computer science research. One such data…

OR