Data pre-processing

Description

Mind Map on Data pre-processing, created by Saravanakumar on 18/02/2015.
Saravanakumar
Mind Map by Saravanakumar, updated more than 1 year ago
Saravanakumar
Created by Saravanakumar about 9 years ago
27
0

Resource summary

Data pre-processing
  1. Data cleaning
    1. Missing values
      1. Ignore the tuple
        1. Fill in the missing value manually
          1. Use a global constant to fill in the missing value
            1. Use the attribute mean to fill in the missing value
              1. Use the attribute mean for all samples belonging to the same class as the given tuple
                1. Use the most probable value to fill in the missing value
                  1. Use the most probable value to fill in the missing value
                  2. Noisy data

                    Annotations:

                    •    Noise is a random error or variance in a measured variable.    
                    1. Binning
                      1. Regression
                      2. Clustering
                      3. Data cleaning as a process
                        1. Data integration and transformation
                          1. Data Integration
                            1. Data Transformation
                              1. Smoothing
                                1. Aggregation
                                  1. Generalization
                                    1. Normalization
                                      1. Attribute construction
                                  2. Data reduction
                                    1. Data cube aggregation
                                      1. Attributes subset selection
                                        1. Stepwise forward selection
                                          1. Stepwise backward elimination:
                                            1. Combination of forward selection and backward elimination
                                              1. Decision tree induction
                                              2. Dimensionality reduction
                                                1. Numerosity reduction
                                                  1. Data discretization and concept hierarchy generation
                                                  2. Why Preprocess the Data?
                                                    1. Data Discretization and Concept Hierarchy Generation
                                                      1. Discretization and Concept Hierarchy Generation for Numerical Data
                                                        1. Concept Hierarchy Generation for Categorical Data
                                                        2. Descriptive Data Summarization
                                                          1. Measuring the Central Tendency
                                                            1. Measuring the Dispersion of Data
                                                              1. Graphic Displays of Basic Descriptive Data Summaries
                                                              Show full summary Hide full summary

                                                              Similar

                                                              C3 - Formulae to learn
                                                              Tech Wilkinson
                                                              IMAGS Employment Examination for Applicants
                                                              mike_101290
                                                              PRACTICA EL SPEAKING DEL FIRST
                                                              Diana GE
                                                              Fractions
                                                              MsHeltonReads
                                                              Spanish Subjunctive
                                                              MrAbels
                                                              Physics: Energy resources and energy transfer
                                                              katgads
                                                              The Many Conjugations of Spanish! Wow!
                                                              hannahkathryn5
                                                              AQA GCSE Physics Unit 3 Mindmap
                                                              Gabi Germain
                                                              The Circulatory System
                                                              Shane Buckley
                                                              Diseño de Software
                                                              Verny Fernandez
                                                              TISSUE TYPES
                                                              Missi Shoup