EDA

Descrição

Bsc Hons Data Mining Mapa Mental sobre EDA, criado por Steve Hiscock em 15-12-2013.
Steve Hiscock
Mapa Mental por Steve Hiscock, atualizado more than 1 year ago
Steve Hiscock
Criado por Steve Hiscock aproximadamente 12 anos atrás
37
0

Resumo de Recurso

EDA
  1. Data Granularity - Levels in the data. ie Time, Years, Months, Weeks, Days, Hours
    1. Consistency - Dates 01/01/2000 or 1/1/00
      1. Corruption and Accuracy - System generated problems / Human errors / Out of date
        1. Data Duplication
          1. Missing Data
            1. SOLUTIONS
              1. capitalisation (transform all)
                1. Combine or concatenations of variables
                  1. Careful use of fomats
                    1. Removals of unwanted characters
                      1. Exclusion
                      2. consistent units
                        1. Add system checks
                          1. Reduce the variable types
                          2. Data Types / Model Roles

                            Anexos:

                            1. Categorise Data
                              1. Discrete Data
                                1. Gender
                                  1. Make of car
                                    1. Number of cars
                                      1. Data that can only take certain values.
                                      2. Continuous Data
                                        1. Data that can take any value (within a range)
                                          1. Bank balances
                                            1. Measurements
                                              1. Dates
                                            2. Data Levels

                                              Anexos:

                                              Semelhante

                                              Chapter 19 Key Terms
                                              Monica Holloway
                                              Data Warehousing and Mining
                                              i7752068
                                              Insurance Policy Advisor
                                              Sufiah Takeisu
                                              Data Mining Part 1
                                              Kim Graff
                                              Minería de Datos.
                                              Marcos Soledispa
                                              Machine Learning
                                              Alberto Ochoa
                                              Data Mining from Big Data 4V-s
                                              Prohor Leykin
                                              Data Mining Process
                                              Steve Hiscock
                                              Data Mining Tasks
                                              Steve Hiscock
                                              Model Roles
                                              Steve Hiscock
                                              Distribution Types
                                              Steve Hiscock