EDA

Beschreibung

Bsc Hons Data Mining Mindmap am EDA, erstellt von Steve Hiscock am 15/12/2013.
Steve Hiscock
Mindmap von Steve Hiscock, aktualisiert more than 1 year ago
Steve Hiscock
Erstellt von Steve Hiscock vor etwa 12 Jahre
37
0

Zusammenfassung der Ressource

EDA
  1. Data Granularity - Levels in the data. ie Time, Years, Months, Weeks, Days, Hours
    1. Consistency - Dates 01/01/2000 or 1/1/00
      1. Corruption and Accuracy - System generated problems / Human errors / Out of date
        1. Data Duplication
          1. Missing Data
            1. SOLUTIONS
              1. capitalisation (transform all)
                1. Combine or concatenations of variables
                  1. Careful use of fomats
                    1. Removals of unwanted characters
                      1. Exclusion
                      2. consistent units
                        1. Add system checks
                          1. Reduce the variable types
                          2. Data Types / Model Roles

                            Anlagen:

                            1. Categorise Data
                              1. Discrete Data
                                1. Gender
                                  1. Make of car
                                    1. Number of cars
                                      1. Data that can only take certain values.
                                      2. Continuous Data
                                        1. Data that can take any value (within a range)
                                          1. Bank balances
                                            1. Measurements
                                              1. Dates
                                            2. Data Levels

                                              Anlagen:

                                              Zusammenfassung anzeigen Zusammenfassung ausblenden

                                              ähnlicher Inhalt

                                              Chapter 19 Key Terms
                                              Monica Holloway
                                              Data Warehousing and Mining
                                              i7752068
                                              Insurance Policy Advisor
                                              Sufiah Takeisu
                                              Data Mining Part 1
                                              Kim Graff
                                              Minería de Datos.
                                              Marcos Soledispa
                                              Machine Learning
                                              Alberto Ochoa
                                              Data Mining from Big Data 4V-s
                                              Prohor Leykin
                                              Data Mining Process
                                              Steve Hiscock
                                              Data Mining Tasks
                                              Steve Hiscock
                                              Model Roles
                                              Steve Hiscock
                                              Distribution Types
                                              Steve Hiscock