DATA9000 Data Analytics Assignment - Dublin Institute of Technology, Ireland
Assignment Brief: The assignment requires the application of the course work to a business problem that you encounter in the workplace. Any area of the course covered may be used.
Some suggestions:
- Forecasting of sales data
- Classification analysis to classify customers
- Data quality report on data set
The above suggestions are indicative of the type of project but feel free to go outside these suggestions. The following table gives the layout required for the assignment.
Assignment Format:
I. Description of Business Issue
II. Explanation of technique(s)
III. Application of Technique
IV. Analysis
V. Lessons learned & recommendations
VI. Overall presentation of report
VII. A personal reflection on the assignment
The question is answered using the IBM SPSS tool and uses the following steps.
1. Initial uploading to SPSS - Examination and exploration of the dataset in variable view.
2. Initial frequency table exploration of the complete dataset.
3. Dataset cleaning - Identification and removal of outliers.
4. Secondary frequency table exploration of the cleaned dataset.
5. Initial box-plot graphical comparison of the dataset.
6. Binning of the data to allow for crosstab and chi squared testing.
7. Correlation table
8. Data modelling
9. Conclusion
Note - Use any database as you want, SPSS work. Total 4000 words including loads of pictures with box-plots etc.