project-proposal-2024

DataEase: Simplifying Data Preparation for Machine Learning Tasks

Abstract

DataEase is your all-in-one dataset preparation solution built for both new and seasoned data scientists, revolutionizing the initial stages of working with machine learning. This tool makes data cleansing, advanced preprocessing, and integration less complicated and more manageable for everyone, without the need for extensive technical knowledge. With the goal of minimizing the time and work required to transform raw data into a form that is ready for analysis and model training, DataEase does it with ease. Focusing on reliability, scalability, and maintainability, it is dedicated to improving the most time consuming dataset preparation stage, which will ultimately promote improved ML model development.

Author

Name: Manvi Narang

Student number: 47324804

Functionality

image

Scope

The primary aim of the MVP is to highlight the implementation of DataEase’s key features.

Quality Attributes

To make sure DataEase is an adaptive and useful tool, we prioritize scalability, maintainability and reliability. This ensures that DataEase can handle increasing data volumes and complexity, adapting to users’ evolving needs as well as allows new updates and bug fixes to make sure it is adaptive. Consistent as well as accurate data preparation is one of the topmost priority for DataEase, fostering user trust.

If tradeoffs were necessary, Reliability would take top priority for DataEase considering that on this attribute, the entire goal and functionality of the project relies. This would be followed by Maintainability and then scalability.

Evaluation

1. Scalability: Scalability can be evaluated as-

2. Maintainability: Maintainability can be evaluated as follows-

3. Reliability: Evaluating Reliability would involve-