Knowledge Check

Question

Which of these is the last step of an iteration within the CRISP-DM process?

Answer 1

Deployment

Answer 2

Evaluation

Answer 3

Mean absolute error

Answer 4

Data Ingestion

Answer 5

Preparing Data

Answer 6

Build and Train Models

Answer 7

Model Deployment

Answer 8

Monitoring Models

Answer 9

The model is too sensitive and a higher threshold would solve the problem.

Answer 10

The model is too specific and a lower threshold would solve the problem.

Answer 11

The model is too specific and a higher threshold would solve the problem.

Answer 12

The model is too sensitive and a lower threshold would solve the problem.

Answer 13

Publicly Identifiable Information

Answer 14

Personally Identifiable Information

Answer 15

Personally Identifiable Index

Answer 16

Publically Incriminating Index

Answer 17

Correlation coefficient

Answer 18

BLEU score

Answer 19

R-squared value

Answer 20

Confusion matrix

Answer 21

Replicate the dataset and run the predictions in production with that.

Answer 22

Train the model with different data because the model must make inferences using the same type of input data it saw during training.

Answer 23

Replicate the dataset and train the model with that.

Answer 24

Do not use the dataset when running the model in production.

Answer 25

Whether the data ingestion processes are event-driven and real time, or nightly batch.

Answer 26

Whether the data is being put into a data lake before analysis

Answer 27

Whether processes str in place to clean and preprocess data before storage

Answer 28

Whether business rules are applied on data in transit or in-situ

Answer 29

sklearn.metrics.r2_score

Answer 30

sklearn.metrics.mean_absolute_error

Answer 31

sklearn.metrics.auc

Answer 32

sklearn.metrics.median_absolute_error

Answer 33

Poisson distribution

Answer 34

Normal distribution

Answer 35

Uniform distribution

Answer 36

Logarithmic distribution

Answer 37

Systematic sampling

Answer 38

Stratified sampling

Answer 39

Convenience sampling

Answer 40

Simple random sampling

Answer 41

Female, Male

Answer 42

Female_No, Male_Yes

Answer 43

Female_Yes, Male_No

Answer 44

Female_Yes, Female_No, Male_Yes, Male_No

Answer 45

t-Distributed Stochastic Neighbor Embedding

Answer 46

Linear Discriminant Analysis

Answer 47

K-means model stacking

Answer 48

Principal Component Analysis

Answer 49

A proprietary platform built by Google and Docker to run and manage your applications.

Answer 50

A container orchestrator to provision, manage, and scale applications.

Answer 51

A serverless platform to build and manage your apps.

Answer 52

An an open-source system to deploy, manage, and run Cloud Foundry apps.

Answer 53

Build the environment in Apache Hive for HDInsight and use Azure Data Factory for orchestration.

Answer 54

Build the environment in Azure Databricks and use Azure Data Factory for orchestration.

Answer 55

Build the environment in Apache Spark for HDInsight and use Azure Container Instances for orchestration.

Answer 56

Build the environment in Azure Databricks and use Azure Container Instances for orchestration.

Next up

Knowledge Check

Description

Resource summary

Question 1

Question 2

Question 3

Question 4

Question 5

Question 6

Question 7

Question 8

Question 9

Question 10

Question 11

Question 12

Question 13

Question 14

Question 15

0 comments

Similar

	Created by Mohammed Arif Mazumder about 5 years ago