Test fundamentos Big Data

Description

test rápido y para repaso
Carolina Colorado
Quiz by Carolina Colorado, updated more than 1 year ago
Carolina Colorado
Created by Carolina Colorado over 7 years ago
56
0

Resource summary

Question 1

Question
Big Data dedicated to
Answer
  • Enterprise Apps
  • analysis

Question 2

Question
Requeriments of big data
Answer
  • Procesing unstructured data
  • Procesing only unstructured data

Question 3

Question
Bit data addresses
Answer
  • combine multiple unreleated datasets
  • Another DW to enrich data

Question 4

Question
BD type of data
Answer
  • machine generate
  • Only human generate
  • hidden data

Question 5

Question
Benefits Big Data
Answer
  • Enterprise app
  • fault and fraud detection
  • slow procesing

Question 6

Question
characteristics of data in big data
Answer
  • complex, variety, volumen, veracity, value
  • variety, volumen, velocity, veracity, value
  • incompatibilities, value,velocity,volumen, veracity, many data

Question 7

Question
characteristics only of data in BigData
Answer
  • value, variety, veracity
  • velocity, volumen, value
  • velocity, variety, volumen

Question 8

Question
veracity
Answer
  • structured data, unstructured data
  • signal, noise
  • social network

Question 9

Question
Human-generated data examples
Answer
  • micro bloggin
  • web log
  • sensor data

Question 10

Question
value
Answer
  • more value more veracity, more value more time
  • more value less veracity, more value less time
  • more value more veracity, more value less time

Question 11

Question
benefits
Answer
  • operational optimization, noise information
  • scientific discoveries, actionable intelligence
  • accurate predictions, shipped cloud computing

Question 12

Question
Dataset
Answer
  • collections or groups of related data, shares the different set of attributes
  • collections or groups of related data, shares the same set of attributes
  • discipline of gaining an understanding of data

Question 13

Question
Data Analysis
Answer
  • discipline of gaining an understanding of data
  • process of examining data to find facts, relationships, patterns
  • collections or groups of related data, shares the same set of attributes

Question 14

Question
analytics
Answer
  • process of gaining an understanding of data
  • discipline of gaining an undestanding of data by analizing it via multitude techniques
  • collections or groups of related data

Question 15

Question
in business-oriented, analytics
Answer
  • results can lower operational costs and facilitate strategic decison-making
  • help identify the cause of a phenomenon to improve the accuracyof predictions
  • help strengthen the focus on delivering high quality services by driving down costs

Question 16

Question
in the scientific domain, analytics
Answer
  • help strengthen the focus on delivering high quality services by driving down costs
  • results can lower operational costs and facilitate strategic decison-making
  • help identify the cause of a phenomenon to improve the accuracyof predictions

Question 17

Question
in services-based enviroments, analytics
Answer
  • results can lower operational costs and facilitate strategic decison-making
  • help strengthen the focus on delivering high quality services by driving down costs
  • help identify the cause of a phenomenon to improve the accuracyof predictions

Question 18

Question
Business intelligent
Answer
  • process of gaining an understanding of data
  • process of gaining insigths into the the workings of an enterprise
  • discipline of gaining an undestanding of data by analizing it via multitude techniques

Question 19

Question
KPI (Key performance indicators)
Answer
  • utilize the consolidated data contained in data warehouses to run analytical queries
  • is a measure for gauging success within a particular context
  • discipline of gaining an undestanding of data by analizing it via multitude techniques

Question 20

Question
KPI used to
Answer
  • achieve regulatory compliance
  • help identify the cause of a phenomenon to improve the accuracyof predictions
  • help strengthen the focus on delivering high quality services by driving down costs

Question 21

Question
measure in Big data
Answer
  • kilometer, megabyte, gigabyte, terabyte, petabyte, exabyte, settabytte, yottabyte
  • kilobyte, megabyte, gigabyte, terabyte, petabyte, exabyte, settabytte, yottabyte
  • kilobyte, megabyte, gigabyte, terabyte, petabyte, exabyte, settabytte, youtube

Question 22

Question
big data emerged from a combination of business needs and technology innovations
Answer
  • true
  • false

Question 23

Question
analitcs & data science
Answer
  • For many business, digital mediums have replaced physical mediums as the de facto
  • Based in opens sorce software tha requires little more than commodity hardware
  • machine learning algoritms, statistical techniques and data warehousing

Question 24

Question
digitalization
Answer
  • Lead to an opportunity to collect further secondary data
  • Collecting and storing more data to potentially find new insigths and gain a competitive edge
  • collecting and processing large quantities of diverse data has become increasingly affordable

Question 25

Question
affordable technology & commodity hardware
Answer
  • the madurity of these fields of practice inspired and enabled much of the core functionality
  • use of commodity hardware makes the adoption of Big Data solutions accessible to business
  • some examples include on-demand TV and streaming video

Question 26

Question
social media
Answer
  • Has empowered customers to provide feedback in near-realtime via open and public mediums
  • Has resulted in massive data streams
  • are capable of providing highly scalable, on-demand IT resources that can be leased

Question 27

Question
hyper-connected communities & devices
Answer
  • A an result, business are storing increasing amounts of data on customer interaction
  • leverage the infrastructure, storage and processing capabilities provided by this enviroments
  • the broadening coverage of internet and the proliferation of cellar and WI-FI networks .

Question 28

Question
cloud computing
Answer
  • Businesses are also increasingly interested in incorporating publicly available datasets
  • Is either directly through online interaction or indirectly through the usage of connected devices
  • Can be leased dramatically reduces the required up-front investment of Big Data projects

Question 29

Question
Online Transaction Processing (OLTP)
Answer
  • software system that processes transaction-oriented data
  • Is a system used for processing data analysis queries
  • process of loading data from a source system into a target system

Question 30

Question
batch-processed
Answer
  • OLTP
  • OLAP

Question 31

Question
data fully normalized
Answer
  • OLTP
  • OLAP

Question 32

Question
C R U D with subsecond response times
Answer
  • OLTP
  • OLAP

Question 33

Question
Online Analytical Processing (OLAP)
Answer
  • software system that processes transaction-oriented data
  • Store operational data that is fully normalized
  • Is a system used for processing data analysis queries

Question 34

Question
data mining and machine learning processes
Answer
  • OLTP
  • OLAP

Question 35

Question
OLAP
Answer
  • Representing a common source of structured analytics input
  • Can serve as both a data source as well as a data sink that capable of receiving data
  • Big data analysis results can also be fed back

Question 36

Question
OLAP
Answer
  • Representing a common source of structured analytics input
  • are used in diagnostic, predictive and prescriptive analytics
  • An example ticket reservation systems and banking and POS transactions

Question 37

Question
data that is aggregated and denormalized
Answer
  • OLAP
  • OLTP

Question 38

Question
OLAP use databases
Answer
  • that store historical data in multidimensional arrays and can answer complex queries
  • are comprised of simple insert, delete and update operations
  • that processes transaction-oriented data

Question 39

Question
An OLAP system is always
Answer
  • fed with data from multiple OLTP system using regular batch processing jobs
  • data is fully normalized
  • comprised of simple C R U D

Question 40

Question
OLAP: denormalized data in the form of cubes
Answer
  • FALSE
  • TRUE

Question 41

Question
Extract-transform-load (ETL)
Answer
  • allows the data to be queried during any data analysis tasks that are performes later
  • Is a process of loading data from a source system into a target system
  • queries can take several minutes or even longer, depending on the complexity of query.

Question 42

Question
Extract-transform-load (ETL) source
Answer
  • database, flat file or an application
  • on-demand TV and streaming video
  • digitalization and social media

Question 43

Question
ETL
Answer
  • Represents the main operation through wich data warehouses are fed data
  • Represents the main operation through wich datasets are fed data
  • Represents de main operation through wich database are fed data

Question 44

Question
ETL
Answer
  • Extract load transform
  • Extract transform load
  • Extract transform leave

Question 45

Question
ETL type data
Answer
  • Unstructure data, structure data and semi structura data
  • Only structure data and semi structura data
  • Only Unstructure data

Question 46

Question
Data Warehouses EDWH has historical data?
Answer
  • FALSE
  • TRUE

Question 47

Question
Data Warehouses EDWH
Answer
  • is a subset of the data, that typically belongs to a deparment
  • is a framework open source
  • usually interface with an OLAP sYstem to support analytical queries

Question 48

Question
EDWH
Answer
  • this allows the data to be queried during any data analysis
  • Heavily used by BI to run various analytical queries
  • software system that processes transaction-orientes data

Question 49

Question
EDWH sources
Answer
  • social media, facebook twitter
  • OLTP, ERP, CRM and SCM systems
  • OLAP, ERP, CRM and SCM systems

Question 50

Question
EDWH
Answer
  • For the amount data contained will continue to increase. The anlysis BI can suffer.
  • Is a process of loading data from a source system into a target system
  • software system that processes transaction-orientes data

Question 51

Question
EDWH
Answer
  • Has established itself as a de facto industry platform for contemporary Big Data solutions
  • usually contain optimized databases, called analytical databases to handle reporting and data analysis
  • Represents de main operation through wich database are fed data

Question 52

Question
EDWH: analytical database can´t exist as separate DBMS
Answer
  • TRUE
  • FALSE

Question 53

Question
Data mart
Answer
  • is a subset of the data, that typically belongs to a deparment
  • can have multiple EDWH
  • based on cleansed data, which is a prerequisite for accurate and error-free reports

Question 54

Question
hadoop is open source framework for
Answer
  • large data storage
  • data processing
  • diagnostic, predictive and prescriptive
  • run on commodity hardware
  • denormalized data in the form of cubes

Question 55

Question
hadoop
Answer
  • has established itself as a de facto industry platform for contemporary Big Data solutions
  • is a central, enterprise-wide repository
  • is always fed with data from multiple OLTP system using regular batch processing jobs

Question 56

Question
hadoop can be used as engine of
Answer
  • ETL
  • analytics
  • OLTP
  • EDWH

Question 57

Question
hadoop can process large amounts of structured, semi-structured and unstructured data
Answer
  • false
  • true

Question 58

Question
volumen refers to
Answer
  • insert data
  • process data
  • velocity processing

Question 59

Question
Data volumes can include
Answer
  • Online transaction
  • sensor data
  • batch
  • social media
  • OLAP
  • scientific and research data

Question 60

Question
velocity
Answer
  • multiple types of data that need to be supported by Big Data solutions
  • data translates into the amount of time it takes for the data to be processed
  • data is processed by Big Data solutions is substantial and usually ever growing

Question 61

Question
Depending on the data source, velocity may not always be high
Answer
  • false
  • true

Question 62

Question
variety
Answer
  • Quality or fidelity of data
  • usefulness of data for an enterprise
  • refers to the multiple formats and types of data that need to be supported by Big Data Solutions

Question 63

Question
veracity
Answer
  • The appropriate form of data storage
  • Refers to the quality or fidelity of data
  • Refers ti the usefulness of data for an enterprise

Question 64

Question
Noise and SIgnal refers to
Answer
  • volumen
  • value
  • veracityx

Question 65

Question
value
Answer
  • refers to quality or fidelity of data
  • refers to the multiple formats and types of data that need to be supported
  • refers to usefulness of data for an enterprise

Question 66

Question
the value is directly related to the veracity in that de higher the data fidelity, the more value it holds for the business.
Answer
  • true
  • false

Question 67

Question
type of data
Answer
  • structured data, unstructured data, semi-structured data
  • structured data, unstructured data, semi-structured data, metadata

Question 68

Question
ERP and CRM are example of
Answer
  • unstructured data
  • structured data
  • semi-structured data

Question 69

Question
image, audio and video files are examples of
Answer
  • semi-structured data
  • unstructured data
  • strutured data

Question 70

Question
Unstructured data generally makes up 80%
Answer
  • true
  • false

Question 71

Question
unstructured data does generally require special or customized logic when it comes to pre-processing and storage
Answer
  • true
  • false

Question 72

Question
semi-structured data
Answer
  • has a defined level of structured and consistency can be relational in nature
  • has a defined level of structured and consistenc, but cannot be relational in nature
  • cannot be inheremtly processed or queried using SQL or traditional programming features

Question 73

Question
semi_structured data
Answer
  • CRM or ERP
  • XML or , electronic data interchanges, e-mails, spreaddheets, RSS feeds and senso data
  • image or adio files

Question 74

Question
metadata
Answer
  • provides information about the analysis
  • provides information about a datasets characteristics and structure

Question 75

Question
metadata generally machine generated and utomatically appended to the data
Answer
  • true
  • false

Question 76

Question
metadata xml tag
Answer
  • provided the author and creation date of a document, file size and resolution of a digital photograph
  • audio binary file
  • videobinary file

Question 77

Question
semi-structured data and unstructured data have a greater noise-to-signal ratio than structured data
Answer
  • true
  • false

Question 78

Question
can ETL can cleansing data and verification
Answer
  • false
  • true

Question 79

Question
data analysis
Answer
  • quantitative analysis
  • cientics analysis
  • qualitative analysis
  • data mining

Question 80

Question
quantitative analysis
Answer
  • quantifying analysis the patterns and correlations found in the data
  • phenomrnon in the data
  • outlayer

Question 81

Question
qualitative analysis use
Answer
  • numbers
  • words
  • graphical

Question 82

Question
involve analyzing a smaller sample in greater depth
Answer
  • quantitave analysis
  • qualitative analysis

Question 83

Question
analysis that targets large datasets
Answer
  • quantitative analysis
  • data mining
  • qualitative analysis

Question 84

Question
data mining (data discovery)
Answer
  • descriptions
  • patterns and correlations
  • identify patterns and trends

Question 85

Question
data mining forms the basis for predictive analytics and business intelligence (BI)
Answer
  • false
  • true

Question 86

Question
analysis tools can automate data analyses
Answer
  • false
  • true

Question 87

Question
descriptive , diagnostic , predictive , prescriptive
Answer
  • types of analytics
  • types of analisys
  • diagnostic

Question 88

Question
questions about events that have already occurred
Answer
  • diagnostic analytics
  • descriptive analytics
  • predictive analytics

Question 89

Question
reporting or dahsboards. The reports are generally static, queries are executed in OLTP such CRM and ERP
Answer
  • descriptive analytis
  • diagnostic analytics
  • preescriptive analytics

Question 90

Question
determine the causes of a phenomenon that occurred in the past
Answer
  • descriptive analytics
  • diagnostic analytics
  • predictive analytics

Question 91

Question
interactive visualization to identify trends and patterns, and queries are executed in OLAP systems
Answer
  • descriptive analytics
  • diagnostic analytics
  • preescriptive analytics

Question 92

Question
attemp to determine the outcome of an event that might occur in the future
Answer
  • preescriptive analytics
  • predictive analytics
  • descriptive analytics

Question 93

Question
The focus is on which prescribed option to follow and why and when it should be followed, to gain and advantage or mitigate risk
Answer
  • descriptive analytics
  • prescriptive analytics
  • predictive analytics

Question 94

Question
incorporate internal data (historical etc..) and external data (social media, demographic data)
Answer
  • descriptive analytics
  • prescriptive analytics
  • diagnostic analytics

Question 95

Question
machine learning
Answer
  • is the process of teaching computers to learn from existing data and apply the acquired knowledge to formulate predictions about unknow data.
  • is the discipline of teaching computers to learn from existing data and apply the acquired knowledge to formulate predictions about unknow data.
  • is the framework refers to teaching computers to learn from existing data and apply the acquired knowledge to formulate predictions about unknow data.

Question 96

Question
based on the input data and categorys
Answer
  • supervised learning
  • unsupervised learning

Question 97

Question
the algorithm attemp to categorized data by grouping data with similara attributes together
Answer
  • supervised learning
  • unsupervised learning

Question 98

Question
machine learning makes predictions and identify hidden patterns
Answer
  • false
  • true

Question 99

Question
machine learning can use the output from data mining for further data classification.
Answer
  • true
  • false

Question 100

Question
traditional BI utilizes
Answer
  • descriptive and diagnostic
  • diagnostic and predictive
  • descriptive an prescriptive

Question 101

Question
ad-hoc reports and dashboards
Answer
  • tradictional Big Data
  • traditional BI

Question 102

Question
ad hoc reporting
Answer
  • the focus is a usually on a specific area of the bussines
  • the focus is view of key bussines areas in real time or near to real time

Question 103

Question
big data
Answer
  • facilitate the development of an enterprise-wide understanding of the way a bussines works
  • focus on indivicual bussines processes
  • descriptive and diagnostic to facilitate the development of an an enterprise-wide understanding of the way a bussines works

Question 104

Question
data visualization analytical results are graphically communicated using elements like
Answer
  • charts, maps, data grids, infographics and alerts
  • ad-hoc reports drill, down
  • dashboards

Question 105

Question
data visualization tool in Big Data use
Answer
  • in-disk analytical tchnologies
  • in-memory analytical technologies

Question 106

Question
aggregation
Answer
  • groups data across multiple categories to show subtotals and totals
  • global an sumarized view of data across multiple context
  • enables a detail view of the data of interest by focusing in on a data subset

Question 107

Question
visualization features: drill down
Answer
  • enables a detail view of the data of interest by focusing in on a data subset
  • global an sumarized view of data across multiple context
  • groups data across multiple categories to show subtotals and totals

Question 108

Question
visualization features: roll up
Answer
  • global an sumarized view of data across multiple context
  • groups data across multiple categories to show subtotals and totals
  • enables a detail view of the data of interest by focusing in on a data subset

Question 109

Question
data visualization tools for Big data solutions incorporate
Answer
  • diagnostic and descriptive
  • predictive a descriptive
  • predictive and prescriptive

Question 110

Question
in the advance visualizations you needs an ETL
Answer
  • true
  • false
Show full summary Hide full summary

Similar

Metodología del Aprendizaje- Módulo 1
Olga Alejandra Zepeda Pérez
Examen de Biologia modulo 1
ferandoba
PROCESO DE RECOLECCIÓN
monicaa91
Jurisprudencia
fer_0696
MANUAL CONCEPTUAL DE LA METODOLOGIA GENERAL AJUSTADA MGA (2015)
Yorlay Socha
LA FUNCIÓN ADMINISTRATIVA DEL ESTADO.
li.li009
1 - Proyectos
Adriel Pellegrini
CONOCER TÉCNICAS PARA EL CONTROL DE LAS EMOCIONES PROPIAS Y AJENAS EN SITUACIONES DE ESTRÉS
ngavilan
DESARROLLO SOSTENIBLE.
JENNY CALDERON
historia de farmacia (modulo uno)
Nicolas Martinez
Administración de Operaciones
Hesie Morales