CS614 Quiz No. 2 Solution and Discussion

Khuram Shahzad

Quiz Start Time: 04:09 PM Time Left 49
sec(s)

Question # 1 of 10 ( Start time: 04:09:48 PM ) Total Marks: 1
In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’ rows in inner table, time complexity is
Select correct option:

O(MN) (Correct)

Quiz Start Time: 04:09 PM Time Left 35
sec(s)

Question # 2 of 10 ( Start time: 04:11:20 PM ) Total Marks: 1
In context of data mining definition, the term “value” means:
Select correct option:

importance of hidden perameters discovered (Correct)

Quiz Start Time: 04:09 PM Time Left 56
sec(s)

Question # 3 of 10 ( Start time: 04:12:51 PM ) Total Marks: 1
Classification consists of examining the properties of a newly presented observation and assigning it to a predefined ____________.
Select correct option:

Class (Correct)

Quiz Start Time: 04:09 PM Time Left 25
sec(s)

Question # 4 of 10 ( Start time: 04:13:44 PM ) Total Marks: 1
The optimizer uses a hash join to join two tables if they are joined using an equijoin and
Select correct option:

Large amount of data need to be joined (Correct)
A large amount of data needs to be joined.
A large portion of the table needs to be joined.

Quiz Start Time: 04:09 PM Time Left 20
sec(s)

Question # 5 of 10 ( Start time: 04:15:07 PM ) Total Marks: 1
In context of nested-loop join, actual number of matching rows returned as a result of the join would be ______ of the order of tables
Select correct option:

Independent (Correct)

Quiz Start Time: 04:09 PM Time Left 13
sec(s)

Question # 6 of 10 ( Start time: 04:16:36 PM ) Total Marks: 1
Normally the input data structure (a database table) for a data mining algorithm:
Select correct option:

Quiz Start Time: 04:09 PM Time Left 12
sec(s)

Question # 7 of 10 ( Start time: 04:18:07 PM ) Total Marks: 1
Mining multi dimensional databases allow users to:
Select correct option:

Analyze Data (Correct)

Quiz Start Time: 04:09 PM Time Left 24
sec(s)

Question # 8 of 10 ( Start time: 04:19:38 PM ) Total Marks: 1
________ refers to the overall process of discovering useful knowledge from data and data mining refers to a particular step in this process.
Select correct option:

Knowledge discovery in database (Correct)

Quiz Start Time: 04:09 PM Time Left 17
sec(s)

Question # 9 of 10 ( Start time: 04:21:01 PM ) Total Marks: 1
In case of nested-loop join, Inner table is accessed _____ for each qualifying row (or touple) in outer table
Select correct option:

One Time (Correct)

Quiz Start Time: 04:09 PM Time Left 10
sec(s)

Question # 10 of 10 ( Start time: 04:22:32 PM ) Total Marks: 1
In contrast to data mining, statistics is ______ driven.
knowledge (Correct)

Quiz Start Time: 08:50 PM Time Left 55
sec(s)

Question # 1 of 10 ( Start time: 08:50:34 PM ) Total Marks: 1
In context of data mining definition, the term “value” means:
Select correct option:

The primary key of table
The index location of the record
Importance of hidden patterns discovered (Answer)
Numerical or string measure assigned to an attribute

Quiz Start Time: 08:50 PM Time Left 22
sec(s)

Question # 2 of 10 ( Start time: 08:51:23 PM ) Total Marks: 1
Data mining is all about:
Select correct option:

Knowledge discovery in database
Finding hidden patterns in data
Finding relationships in data
All of the given options ( may be Answer)

Quiz Start Time: 08:50 PM Time Left 19
sec(s)

Question # 3 of 10 ( Start time: 08:52:41 PM ) Total Marks: 1
In contrast to statistics, data mining is ______ driven.
Select correct option:

Assumption (Answer)
Knowledge
Human
Database

Quiz Start Time: 08:50 PM Time Left 55
sec(s)

Question # 4 of 10 ( Start time: 08:53:56 PM ) Total Marks: 1
Mining multi dimensional databases allow users to:
Select correct option:

Categorize the data
Analyze the data (Answer)
Summarize the data
All of the given options

Quiz Start Time: 08:50 PM Time Left 18
sec(s)

Question # 5 of 10 ( Start time: 08:54:40 PM ) Total Marks: 1
In context of data mining definition, the term “nontrivial” means:
Select correct option:

Discovering information is a simple task
Discovering information is a complex task
We can not discover information
We simply find things rather than discovery (Answer)

Quiz Start Time: 08:50 PM Time Left 1
sec(s)

Question # 6 of 10 ( Start time: 08:55:56 PM ) Total Marks: 1
Identify the TRUE statement:
Select correct option:

Clustering is unsupervised learning and classification is supervised learning
Clustering is supervised learning and classification is unsupervised learning
Both clustering and classification are unsupervised learning
Both clustering and classification are supervised learning

Quiz Start Time: 08:50 PM Time Left 27
sec(s)

Question # 7 of 10 ( Start time: 08:57:26 PM ) Total Marks: 1
In ________learning you don’t know the number of clusters and no idea about their attributes.
Select correct option:

Supervised learning
Unsupervised learning (Answer)
Multi Dimension modeling
None of the given options

Quiz Start Time: 08:50 PM Time Left 50
sec(s)

Question # 8 of 10 ( Start time: 08:58:39 PM ) Total Marks: 1
In context of clustering, the term “distance” means:
Select correct option:

Similarity/dissimilarly of records (Answer)
The difference between the primary keys of two records
The relation of a record with corresponding record in child table
None of the given options

Quiz Start Time: 08:50 PM Time Left 15
sec(s)

Question # 9 of 10 ( Start time: 08:59:22 PM ) Total Marks: 1
In data mining, initially you _____ what you are looking for.
Select correct option:

Know
Don’t know (Answer)
May or may not know
None of the given options

Quiz Start Time: 08:50 PM Time Left 52
sec(s)

Question # 10 of 10 ( Start time: 09:00:41 PM ) Total Marks: 1
The optimizer uses a hash join to join two tables if they are joined using an equijoin and
Select correct option:

Outer table has less number of rows
Inner table has less number of rows
Cardinality of tables is equal
Large amount of data needs to be joined (Answer)

zareen

In context of the most fundamental data warehouse life cycle model, which of the following is NOT one of the data warehouse design activities?
Select correct option:
End-user interviews and re-interviews
Source system cataloguing
Definition of key performance indicators
System vision development

zareen

Vertically wide data means:

zareen

In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that:

zareen

Data mining is all about:

zareen

“If resources increase in proportion to increase in data size, time is constant”. The statement refers to:

zareen

An effective user education program includes, among others, the following guideline(s):

zareen

Implementation of a data warehouse requires ________ activities

Highly integrated

Loosely integrated

Tightly decoupled

None of the given

zareen

Which of the following is NOT one of the three parallel tracks in Kimballs approach? CS614

zareen

Which of the following is NOT one of the methodologies for Data Warehouse project development?

System Driven

zareen

In contrast to data mining, statistics is ______ driven. CS614

zareen

Quiz Start Time: 04:09 PM Time Left 49
sec(s)

Question # 1 of 10 ( Start time: 04:09:48 PM ) Total Marks: 1
In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’ rows in inner table, time complexity is
Select correct option:

O(MN) (Correct)

Quiz Start Time: 04:09 PM Time Left 35
sec(s)

Question # 2 of 10 ( Start time: 04:11:20 PM ) Total Marks: 1
In context of data mining definition, the term “value” means:
Select correct option:

importance of hidden perameters discovered (Correct)

Quiz Start Time: 04:09 PM Time Left 56
sec(s)

Question # 3 of 10 ( Start time: 04:12:51 PM ) Total Marks: 1
Classification consists of examining the properties of a newly presented observation and assigning it to a predefined ____________.
Select correct option:

Class (Correct)

Quiz Start Time: 04:09 PM Time Left 25
sec(s)

Question # 4 of 10 ( Start time: 04:13:44 PM ) Total Marks: 1
The optimizer uses a hash join to join two tables if they are joined using an equijoin and
Select correct option:

Large amount of data need to be joined (Correct)
A large amount of data needs to be joined.
A large portion of the table needs to be joined.

Quiz Start Time: 04:09 PM Time Left 20
sec(s)

Question # 5 of 10 ( Start time: 04:15:07 PM ) Total Marks: 1
In context of nested-loop join, actual number of matching rows returned as a result of the join would be ______ of the order of tables
Select correct option:

Independent (Correct)

Quiz Start Time: 04:09 PM Time Left 13
sec(s)

Question # 6 of 10 ( Start time: 04:16:36 PM ) Total Marks: 1
Normally the input data structure (a database table) for a data mining algorithm:
Select correct option:

Quiz Start Time: 04:09 PM Time Left 12
sec(s)

Question # 7 of 10 ( Start time: 04:18:07 PM ) Total Marks: 1
Mining multi dimensional databases allow users to:
Select correct option:

Analyze Data (Correct)

Quiz Start Time: 04:09 PM Time Left 24
sec(s)

Question # 8 of 10 ( Start time: 04:19:38 PM ) Total Marks: 1
________ refers to the overall process of discovering useful knowledge from data and data mining refers to a particular step in this process.
Select correct option:

Knowledge discovery in database (Correct)

Quiz Start Time: 04:09 PM Time Left 17
sec(s)

Question # 9 of 10 ( Start time: 04:21:01 PM ) Total Marks: 1
In case of nested-loop join, Inner table is accessed _____ for each qualifying row (or touple) in outer table
Select correct option:

One Time (Correct)

Quiz Start Time: 04:09 PM Time Left 10
sec(s)

Question # 10 of 10 ( Start time: 04:22:32 PM ) Total Marks: 1
In contrast to data mining, statistics is ______ driven.
knowledge (Correct)

Quiz Start Time: 08:50 PM Time Left 55
sec(s)

Question # 1 of 10 ( Start time: 08:50:34 PM ) Total Marks: 1
In context of data mining definition, the term “value” means:
Select correct option:

The primary key of table
The index location of the record
Importance of hidden patterns discovered (Answer)
Numerical or string measure assigned to an attribute

Quiz Start Time: 08:50 PM Time Left 22
sec(s)

Question # 2 of 10 ( Start time: 08:51:23 PM ) Total Marks: 1
Data mining is all about:
Select correct option:

Knowledge discovery in database
Finding hidden patterns in data
Finding relationships in data
All of the given options ( may be Answer)

Quiz Start Time: 08:50 PM Time Left 19
sec(s)

Question # 3 of 10 ( Start time: 08:52:41 PM ) Total Marks: 1
In contrast to statistics, data mining is ______ driven.
Select correct option:

Assumption (Answer)
Knowledge
Human
Database

Quiz Start Time: 08:50 PM Time Left 55
sec(s)

Question # 4 of 10 ( Start time: 08:53:56 PM ) Total Marks: 1
Mining multi dimensional databases allow users to:
Select correct option:

Categorize the data
Analyze the data (Answer)
Summarize the data
All of the given options

Quiz Start Time: 08:50 PM Time Left 18
sec(s)

Question # 5 of 10 ( Start time: 08:54:40 PM ) Total Marks: 1
In context of data mining definition, the term “nontrivial” means:
Select correct option:

Discovering information is a simple task
Discovering information is a complex task
We can not discover information
We simply find things rather than discovery (Answer)

Quiz Start Time: 08:50 PM Time Left 1
sec(s)

Question # 6 of 10 ( Start time: 08:55:56 PM ) Total Marks: 1
Identify the TRUE statement:
Select correct option:

Clustering is unsupervised learning and classification is supervised learning
Clustering is supervised learning and classification is unsupervised learning
Both clustering and classification are unsupervised learning
Both clustering and classification are supervised learning

Quiz Start Time: 08:50 PM Time Left 27
sec(s)

Question # 7 of 10 ( Start time: 08:57:26 PM ) Total Marks: 1
In ________learning you don’t know the number of clusters and no idea about their attributes.
Select correct option:

Supervised learning
Unsupervised learning (Answer)
Multi Dimension modeling
None of the given options

Quiz Start Time: 08:50 PM Time Left 50
sec(s)

Question # 8 of 10 ( Start time: 08:58:39 PM ) Total Marks: 1
In context of clustering, the term “distance” means:
Select correct option:

Similarity/dissimilarly of records (Answer)
The difference between the primary keys of two records
The relation of a record with corresponding record in child table
None of the given options

Quiz Start Time: 08:50 PM Time Left 15
sec(s)

Question # 9 of 10 ( Start time: 08:59:22 PM ) Total Marks: 1
In data mining, initially you _____ what you are looking for.
Select correct option:

Know
Don’t know (Answer)
May or may not know
None of the given options

Quiz Start Time: 08:50 PM Time Left 52
sec(s)

Question # 10 of 10 ( Start time: 09:00:41 PM ) Total Marks: 1
The optimizer uses a hash join to join two tables if they are joined using an equijoin and
Select correct option:

Outer table has less number of rows
Inner table has less number of rows
Cardinality of tables is equal
Large amount of data needs to be joined (Answer)

zareen

1_ As per Bill Inmost, a data warehouse, in contrast with classical applications is:

Data driven _ pg285

2_ Which of the following is NOT one of the three parallel tracks in Kimball’s approach?
Lifecycle Maintenance track

3_ Bill Inmon argues that requirements are well understood only after
Data warehouse is populated _ pg285

4_ Goal driven approach of data warehouse development was result of ______ work
Böhnlein and Ulbrich-vom _ pg285

5_ Identify the TRUE statement:
Clustering is unsupervised learning and classification is supervised learning _ pg 270

6_ Normally the term “DWH face to the business user” refers to:
Lifecycle Analytical Applications track _ pg 306

7_ In ________learning you don’t know the number of clusters and no idea about their attributes.
Unsupervised learning
https://www.cs.uic.edu/~liub/teach/cs583-fall-05/CS583-unsupervised-learning.ppt

8_ Waterfall model is appropriate when
Requirements are clearly defined _ pg 284

9_ Implementation of a data warehouse requires ________ activities.
none of above

10_ Normally the input data structure (a database table) for a data mining algorithm:
Has more number of records than attributes (not sure)

cyberian

@zareen said in CS614 Quiz No. 2 Solution and Discussion:

Q10: Which of the following is NOT one of the variants of Nested-loop join?
Binary index nested-loop join.

The following are variants of the nested-loop join:

Basic Nested-Loop Join: A straightforward nested-loop join where for each tuple in the outer relation, the inner relation is scanned entirely.
Block Nested-Loop Join: Instead of processing one tuple at a time, this method processes a block of tuples from the outer relation, which can reduce the number of disk I/O operations.
Index Nested-Loop Join: This variant uses an index on the inner relation to quickly find matching tuples, which can significantly speed up the join operation.

NOT a variant:

Hash Join: This is not a variant of the nested-loop join. Instead, it’s a different join algorithm that uses a hash table to partition one or both of the relations before performing the join.

So, Hash Join would be the correct answer if you’re asked which one is not a variant of the nested-loop join.

cyberian

@zareen said in CS614 Quiz No. 2 Solution and Discussion:

1_ As per Bill Inmost, a data warehouse, in contrast with classical applications is:

Data driven _ pg285

According to Bill Inmon, a prominent figure in the field of data warehousing, a data warehouse is distinguished from classical applications by several key characteristics:

Subject-Oriented: Data warehouses are designed around key subjects or business areas (such as sales, finance, or customer information), rather than focusing on individual applications or processes.
Integrated: Data in a data warehouse is collected from multiple sources and is integrated into a consistent format. This integration allows for unified reporting and analysis across the organization.
Time-Variant: Data warehouses store historical data, allowing users to analyze trends and changes over time. This time dimension is crucial for tracking business performance and making long-term decisions.
Non-Volatile: Once data is entered into a data warehouse, it is not typically updated or deleted. This ensures that the data remains stable and can be used for historical analysis without being altered.
Optimized for Querying and Reporting: Unlike classical applications, which are optimized for transaction processing, data warehouses are optimized for complex queries and analysis. This optimization allows for efficient reporting and decision-making.

In summary, according to Bill Inmon, a data warehouse is subject-oriented, integrated, time-variant, non-volatile, and optimized for querying and reporting, distinguishing it from classical applications that are more focused on transaction processing and operational tasks.

cyberian

@zareen said in CS614 Quiz No. 2 Solution and Discussion:

4_ Goal driven approach of data warehouse development was result of ______ work
Böhnlein and Ulbrich-vom _ pg285

The goal-driven approach to data warehouse development resulted from Bill Inmon’s work.

Bill Inmon, often referred to as the “father of data warehousing,” advocated for a methodical, top-down approach to data warehouse development. This approach focuses on defining business goals and requirements first and then designing the data warehouse to meet these needs. It emphasizes the importance of a clear understanding of business objectives and data requirements before designing and implementing the data warehouse.

In contrast, Ralph Kimball’s approach is known for its bottom-up methodology, which focuses on building data marts first and then integrating them into a comprehensive data warehouse. Both approaches have their own merits, but Inmon’s goal-driven approach is specifically recognized for its emphasis on aligning data warehouse development with business goals and objectives.

CS614 Quiz No. 2 Solution and Discussion

CS614 Assignment 3 Solution and Discussion

CS614 Assignment No.2 Solution and Discussion

CS614 Assignment 2 Solution and Discussion Spring 2020

CS614 Assignment 1 Solution and Discussion

CS614 Assignment 3 Solution and Discussion

CS614 GDB 1 Solution and Discussion

CS614 Assignment 2 Solution and Discussion

CS614 Assignment 1 Solution and Discussion

CS614 Assignment No.3 Solution and Discussion

CS614 GDB Solution and Discussion

How aggregates awareness helps the users?

Normalization effects performance?

CS614 Mid Term Past Paper and Please share your current Paper

CS614 Quiz No. 1 Solution and Discussion

CS614 Assignment No.2 Solution and Discussion

Assignment No. 1  Semester: Spring 2019 CS614 – Data Warehousing

CS614 Quiz No. 2 Solution and Discussion

Reputation Earning

Ads

File Sharing

Stats

0

3.1k

2.8k

8.6k

Popular Tags

Trending

Online User