Establishing Data Mining Goals
- Get link
- X
- Other Apps
Establishing Data Mining Goals:
- Identify key questions to be answered.
- Consider costs, benefits, and expected accuracy.
- Address cost-benefit trade-offs for desired accuracy levels.
Selecting Data:
- Quality of data is crucial for data mining outcomes.
- Availability varies; may require new data collection initiatives.
- Type, size, and frequency of data collection impact mining costs.
Preprocessing Data:
- Raw data may be messy with errors or missing information.
- Identify and remove irrelevant attributes.
- Address errors, ensuring data integrity.
- Develop methods to handle missing data systematically or randomly.
Transforming Data:
- Determine appropriate data format and reduce attributes.
- Use data reduction algorithms (e.g., Principal Component Analysis).
- Transform variables for better representation.
- Convert continuous variables to categorical for capturing non-linearities.
Storing Data:
- Store transformed data conducive for data mining.
- Allow unrestricted read/write access for data scientists.
- Ensure efficient data storage and prioritize data safety and privacy.
Mining Data:
- Apply data mining methods, including parametric, non-parametric, and machine-learning algorithms.
- Start with data visualization for a preliminary understanding of trends.
Evaluating Mining Results:
- Formally evaluate results, testing predictive capabilities on observed data (in-sample forecast).
- Share results with key stakeholders for feedback.
- Incorporate feedback into subsequent iterations for continuous improvement.
- Data mining and result evaluation form an iterative process.
- Get link
- X
- Other Apps
Comments
Post a Comment