Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Pro Blog
  • Users
  • Groups
  • Unsolved
  • Solved
Collapse
Secnto AI
  1. Secnto AI
  2. Categories
  3. Virtual University
  4. CS614 - Data Warehousing
  5. CS614 Quiz No. 2 Solution and Discussion
CS614 Quiz No. 2 Solution and Discussion
zaasmiZ
Please share your current Quiz. 2 to help the students…
CS614 - Data Warehousing
CS614 Assignment 3 Solution and Discussion
zareenZ
Re: CS614 Assignment 3 Solution and Discussion Assignment No. 3 Semester: Spring 2020 CS614 – Data Warehousing Total Marks: 10 Due Date: July 27, 2020 Objectives: After completing this assignment, the students will be able to: • compare Parallel Processing and Serial Processing • describe what and when to Parallelize • calculate Speed up using Amdahl’s Law Instructions Please read the following instructions carefully before submitting assignment: It should be clear that your assignment will not get any credit if: • Assignment is submitted after due date. • Submitted assignment does not open or file is corrupt. • Assignment is copied (From internet/ to from students). • Assignment is submitted other than word format (.doc, .docx). Assignment Scenario: XYZ is a fabric manufacturing company. This company is planning to implement DWH for its existing OLTP system implemented in 300 + stores all over Pakistan. Size of company’s Database is approx. 298 GB that grows at the rate of 0.8 GB per day approximately. Company has designed a state of the art DWH in its head office in terms of hardware and software. Hardware consists of 7 Systems having quad-core processors. In ideal situation, a complex query would return results in 27 minutes on single processor (serial execution). After analyzing the scenario given above, you are required to answer the following questions; Question # 1 – If a complex query is executed parallel on 6 single core processors then what would be the quantified speed up time. Question # 2 – Calculate Speed up ratio using Amdahl’s Law, if 80% of query processing is done through parallel execution on DWH hardware. Deadline: Your assignment must be uploaded on VULMS on or before July 27, 2020. While July 28, 2020 will be a bonus day for assignment submission. After the bonus day, no assignment would be entertained via email.
CS614 - Data Warehousing
CS614 Assignment No.2 Solution and Discussion
zareenZ
Re: CS614 Assignment No.2 Solution and Discussion Assignment No. 2 Semester: Spring 2020 CS614 – Data Warehousing Total Marks: 15 Due Date: June 17, 2020 Objectives: After completing this assignment, the students will be able to: • De-Normalize the given table using horizontal splitting technique • Calculate the Total space used with normalization. • Calculate the Total space used after de-normalization. Instructions Please read the following instructions carefully before submitting assignment: It should be clear that your assignment will not get any credit if: • Assignment is submitted after due date. • Submitted assignment does not open or file is corrupt. • Assignment is copied (From internet/ to from students). • Assignment is submitted other than word format (.doc, .docx). Assignment Question No. 1 Consider the following table having the information of students of a university: Student ID Student Name Campus ID Student Age Degree Program 1 Ali VLHR01 27 MS 2 Kamran VISB01 24 BS 3 Akmal VRWP01 24 BS 4 Ahmad VLHR01 26 MS 5 Rehan VISB01 23 BS 6 Rizwan VRWP01 29 MS 7 Umer VISB01 25 BS 8 Javed VLHR01 26 MS You are required to completely de-normalize the above table using “horizontal splitting” on the basis of Degree Program. Question No. 2 Consider the following normalized tables for a telecommunication company showing the daily call record details of customers: Customer_ID Customer Phone No. Balance 1 033XXXXX 300 2 033YYYYY 250 3 033ZZZZZZ 300 4 033AAAAA 1000 5 033BBBBB 80 6 033CCCCC 554 … … … Call_ID Customer_ID Dialed Phone Number Duration Call Charges 1 1 032ABCVD 1 minute 2 RS 2 1 032ABCVG 2 minutes 4 RS 3 1 032ABCVD 1 minute 2 RS 4 2 032ANNNN 3 minutes 6 RS 5 2 032AMMM 4 minutes 8 RS 6 3 033RRRRR 1 minute 2 RS … … … .. … Due to certain performance factors company wants to de-normalize the tables using pre-joining technique. Table Information is given below: • Assume 1:4 record count ratio between customer Info (master) and Call record detail (detail). • Assume 15 million customers. • Assume 10 byte Customer_ID. • Assume 50 byte header for customer Info (master) and 80 byte header for Call record detail (detail) tables. You are required to perform the following tasks: • Calculate the Total space in GBs used with normalization. • Calculate the Total space in GBs used after de-normalization. Deadline: Your assignment must be uploaded on VULMS on or before June 17, 2020. While June 18, 2020 will be a bonus day for assignment submission. After the bonus day, no assignment would be entertained via email.
CS614 - Data Warehousing
CS614 Assignment 2 Solution and Discussion Spring 2020
F
Re: CS614 Assignment 2 Solution and Discussion Assignment No. 2 Semester: Spring 2020 CS614 – Data Warehousing Total Marks: 15 Due Date: June 17, 2020 Objectives: After completing this assignment, the students will be able to: • De-Normalize the given table using horizontal splitting technique • Calculate the Total space used with normalization. • Calculate the Total space used after de-normalization. Instructions Please read the following instructions carefully before submitting assignment: It should be clear that your assignment will not get any credit if: • Assignment is submitted after due date. • Submitted assignment does not open or file is corrupt. • Assignment is copied (From internet/ to from students). • Assignment is submitted other than word format (.doc, .docx). Assignment Question No. 1 Consider the following table having the information of students of a university: Student ID Student Name Campus ID Student Age Degree Program 1 Ali VLHR01 27 MS 2 Kamran VISB01 24 BS 3 Akmal VRWP01 24 BS 4 Ahmad VLHR01 26 MS 5 Rehan VISB01 23 BS 6 Rizwan VRWP01 29 MS 7 Umer VISB01 25 BS 8 Javed VLHR01 26 MS You are required to completely de-normalize the above table using “horizontal splitting” on the basis of Degree Program. Question No. 2 Consider the following normalized tables for a telecommunication company showing the daily call record details of customers: Customer Info Customer_ID Customer Phone No. Balance 1 033XXXXX 300 2 033YYYYY 250 3 033ZZZZZZ 300 4 033AAAAA 1000 5 033BBBBB 80 6 033CCCCC 554 … … … Call record detail Call_ID Customer_ID Dialled Phone Number Duration Call Charges 1 1 032ABCVD 1 minute 2 RS 2 1 032ABCVG 2 minutes 4 RS 3 1 032ABCVD 1 minute 2 RS 4 2 032ANNNN 3 minutes 6 RS 5 2 032AMMM 4 minutes 8 RS 6 3 033RRRRR 1 minute 2 RS … … … … … Due to certain performance factors company wants to de-normalize the tables using pre-joining technique. Table Information is given below: • Assume 1:4 record count ratio between customer Info (master) and Call record detail (detail). • Assume 15 million customers. • Assume 10 byte Customer_ID. • Assume 50 byte header for customer Info (master) and 80 byte header for Call record detail (detail) tables. You are required to perform the following tasks: • Calculate the Total space in GBs used with normalization. • Calculate the Total space in GBs used after de-normalization. Deadline: Your assignment must be uploaded on VULMS on or before June 17, 2020. While June 18, 2020 will be a bonus day for assignment submission. After the bonus day, no assignment would be entertained via email.
CS614 - Data Warehousing
CS614 Assignment 1 Solution and Discussion
M
Re: CS614 Assignment 1 Solution and Discussion Assignment No. 1 Semester: Spring 2020 CS614 – Data Warehousing Total Marks: 20 Due Date: June 01, 2020 Objectives: After completing this assignment, the students will be able to: • Identify Database entities from a given scenario • Understanding and designing system’s constraints from given scenario • Understand the database table structure • Normalize a database table up to 2nd Normal Form (2NF) Instructions Please read the following instructions carefully before submitting assignment: It should be clear that your assignment will not get any credit if: o Assignment is submitted after due date. o Submitted assignment does not open or file is corrupt. o Assignment is copied (From internet/ to from students). o Assignment is submitted other than word format (.doc, .docx). Assignment XYZ polyclinic is a well-reputed clinic providing good medical facilities in a posh area of Lahore. They also have a facility to admit a patient if the treatment requires. XYZ polyclinic was working manually for the last 10 years. You are hired by XYZ polyclinic management to design a database by reading the system’s requirements. Management of XYZ polyclinic decides to improve their system that would lead to convert the Database system to Data Warehouse for XYZ polyclinic in the future. Database Requirements: The organization is interested in storing name, patient’s CNIC, address, city, zip, province, country, phone number, referred by, admission date, and discharge date for storing patient’s information in the database. The referred by (would contain doctor’s id), admission date and discharge date attributes are only used in case of patient’s admission in a specific room. From the doctor’s point of view, they are interested to store doctor name, CNIC of doctor, address, city, zip, province, country, doctor’s phone number, area of specialization. We must store the treatment information that is suggested by the doctor for a specific patient which includes prescribed medicines. Medicine details must contain medicine id, name, dosage, and potency as attributes along with reference of treatment that is suggested by a doctor. If a patient would require getting admit in polyclinic then the room type (executive/common), phone extension and charges per day are going to be stored in the system as room attributes. Some additional constraints about the system are as under; One patient is referred to one/more doctor at a time. Multiple patients would be examined by the doctor in a day One patient may have multiple visits to a single doctor Room capacity/facilities for attendants would not be handled at this time One patient may use many medicines as suggested by doctor Every time visit to a doctor may result in a change of medicine or admitted to the polyclinic TASK to Perform and Submit: Identify relevant entities, primary keys, foreign keys, proper attributes and relations as per 2NF for the above scenario and provide database/schema/table-structure in MS-Word format that is Normalized up to 2nd Normal Form. Important Note: There’s no need to implement the solution using any DBMS Attribute Names should be clearly mentioned Deadline: Your assignment must be uploaded on VULMS on or before June 01, 2020. While June 02, 2020 will be a bonus day for assignment submission. After the bonus day, no assignment would be entertained via email.
CS614 - Data Warehousing
CS614 Assignment 3 Solution and Discussion
zareenZ
Assignment No. 03 Semester: Fall 2019 CS614: Data Warehousing Total Marks: 15 Due Date: 27-Jan-2020 Objective: The objective of this assignment is to enhance the learning capabilities of the students about: • Join Techniques • Hash based join • Sort-Merge join Instructions: Please read the following instructions carefully before submitting assignment: You need to use MS word document to prepare and submit the assignment on VU-LMS. It should be clear that your assignment will not get any credit if:  The assignment is submitted after due date.  The assignment is not in the required format (.doc or docx)  The submitted assignment does not open or file is corrupt.  Assignment is copied(partial or full) from any source (websites, forums, students, etc) Assignment Question HyperMart is an online shopping store currently acquiring a large number of customers. Normalized table structures for the shopping store are given below: Order Table Order_ID Order_Date 100 2-Feb-2019 201 2-Feb-2019 300 4-Feb-2019 …. … Order details Table Detail_ID Order_ID Product_ID Product_Quantity Sale_Amount 100 100 2 1 100 101 100 3 2 200 200 201 5 1 50 201 201 7 1 90 300 300 15 1 600 301 300 56 1 800 302 300 57 1 850 …. … … …. … Table Information • Assume 1:12 record count ratio between Order and Order detail for online store’s database • Assume 10,000 orders Task For the given relations you are required to calculate the following costs in terms of I/O operations and find out which joining technique is better for given scenario. Tasks you are required to perform are as under; Cost of Sort-Merge Join Cost of Hash-Join On the basis of your calculations, suggest better joining technique between sort-merge and hash join for the given scenario Best of Luck!
CS614 - Data Warehousing
CS614 GDB 1 Solution and Discussion
zareenZ
Total Marks 5 Starting Date Wednesday, January 15, 2020 Closing Date Thursday, January 16, 2020 Status Open Question Title GDB-CS614 Question Description Scenario Pulse Globe Energy Limited (PGEL) is involved in the undertaking of gas exploration, development and production activities in Australia and Asia. PGEL has investments in upstream gas activities and electricity generation that complement wholesale energy contracts to support the retail customer base. For one of the projects PGEL has already acquired the Seismic data which provides a “time picture” of subsurface structure to aid in gas exploration. You are hired by the company as Data Analyst to suggest / identify the most desirable areas for gas exploration by using your analytical stills, as company would have to invest a huge budget of 1 million dollar for this task. After reading the above scenario you are required to answer the following question: Suggest the most suitable data mining technique for the given scenario and also support your answer with one valid reason (Not lengthy more than 2 – 3 lines). Format of your answer would be as given below: Technique Name: ________________________ 1 Strong Reason to Choose: _______________________________________________________
CS614 - Data Warehousing
CS614 Assignment 2 Solution and Discussion
zareenZ
Assignment No. 02 Semester: Fall 2019 CS614: Data Warehousing Total Marks: 10 Due Date: November 28, 2019 Objective: The objective of this assignment is to enhance the learning capabilities of the students about: • De-Normalization • Pre-Joining • Storage Issues of Pre-joining Instructions: Please read the following instructions carefully before submitting assignment: You need to use MS word document to prepare and submit the assignment on VU-LMS. It should be clear that your assignment will not get any credit if:  The assignment is submitted after due date.  The assignment is not in the required format (.doc or docx)  The submitted assignment does not open or file is corrupt.  Assignment is copied (partial or full) from any source (websites, forums, students, etc) Assignment Question “Kare Pharma” is an online Medical store currently acquiring a large number of customers. To manage some performance issues this online Medical store requires to de-normalize its database using pre-joining technique for Prescription and Prescription details tables. Normalized table structures are given below: Prescription Table Prescription_ID Patient_Name Doctor_Name Prescription_Date …. … Prescription details Table Transaction_ID Prescription_ID Med_ID Med_Quantity Sale_Amount …. … … …. … Table Information • Assume 1:11 record count ratio between Prescription table as master table and Prescription details table for online Medical store’s database. • Assume 10 million records in Prescription Table. • Assume 10 bytes reserved for Prescription_ID in memory. • Assume 40 bytes header for master table and 70 bytes header for details table. Task You are required to perform the following tasks: Calculate the total space reserved in memory using normalization Calculate the total space reserved in memory after de-normalization using pre-joining technique Best of Luck!
CS614 - Data Warehousing
CS614 Assignment 1 Solution and Discussion
zareenZ
Assignment No. 1 Semester: Fall 2019 CS614 – Data Warehousing Total Marks: 15 Due Date: November 14, 2019 Objectives: After completing this assignment the students will be able to: • Identify Database entities from a given scenario • Understand the database table structure • Normalize a database table up to 2nd normal form • De-normalize relationships using collapsing table technique Instructions Please read the following instructions carefully before submitting assignment: It should be clear that your assignment will not get any credit if: o Assignment is submitted after due date. o Submitted assignment does not open or file is corrupt. o Assignment is copied (From internet/ to from students). o Assignment is submitted other than word format (.doc, .docx). Assignment Question No. 1 Consider the following schema related to a Social Media website named as ‘userPosts’. You have to perform following tasks related to the provided schema: 1- Identify appropriate keys for following structure (Primary and/or foreign key(s)) 2- Convert this schema into 2 NF userPosts (userID, userName, password, address, postId, postDate, postContent) Question No. 2 Consider the following schemas relevant to a hotel booking website. You are required to De-Normalization the given schemas using Collapsing Tables Technique. roomVisitor (roomID, visitorCNIC, dateTime) roomCharges (roomID, spentDays, roomRent) Deadline: Your assignment must be uploaded on VULMS on or before November 14, 2019. While November 15, 2019 will be a bonus day for assignment submission. After the bonus day, no assignment would be entertained via email.
CS614 - Data Warehousing
CS614 Assignment No.3 Solution and Discussion
zaasmiZ
Topic thumbnail image
CS614 - Data Warehousing
CS614 GDB Solution and Discussion
zaasmiZ
Total Marks 5 Starting Date Monday, July 15, 2019 Closing Date Tuesday, July 16, 2019 Status Open Question Title GDB - CS614 Question Description Scenario “Choice Restaurant” is a chain of restaurants having five branches across the country. This restaurant typically deals in fast food and has 40 to 100 customers a day in each branch. The Management of this company has introduced an Online Transaction Processing (OLTP) system to handle the daily transactions. The Management is very concerned for cutting up the cost and attain customer satisfaction, therefore currently there is only one computer system per branch which is handling all the computing tasks (including transaction processing) but in future they have plans to induce DSS. To speed up the online transaction processing the database manager is compelling the management to implement parallelism for this OLTP system during business hours for daily sales calculations. Management is yet to decide that whether they should invest on the parallelism or not. GDB Question: Keeping the above scenario in mind do you think that implementing parallelism in above situation is a good option? Justify your answer with appropriate reason(s) in either case. GDB Answer Template: Choice: Parallelism used or not Points of Justification: Point 1: Point 2: Important Notes: NO GDB is accepted via e-mail in any case Lengthy replies of GDB will cause in deduction of marks. So you write your answer precisely within 3 to 5 lines in 2 points along with choice. If you would not mention your choice at start of your answer, your GDB points would not be marked.
CS614 - Data Warehousing
How aggregates awareness helps the users?
zaasmiZ
Anybody can describe?
CS614 - Data Warehousing
Normalization effects performance?
zaasmiZ
True False
CS614 - Data Warehousing
CS614 Mid Term Past Paper and Please share your current Paper
zaasmiZ
cs614-Mid term Solved MCQs With References by moaaz.pdf cs614-Mid term Solved Subjectives With References by moaaz.pdf
CS614 - Data Warehousing
CS614 Quiz No. 1 Solution and Discussion
zaasmiZ
Topic thumbnail image
CS614 - Data Warehousing
CS614 Assignment No.2 Solution and Discussion
zaasmiZ
Topic thumbnail image
CS614 - Data Warehousing
Assignment No. 1
 Semester: Spring 2019 CS614 – Data Warehousing
cyberianC
Spring 2019_CS614_1.doc Assignment No. 1
Semester: Spring 2019 CS614 – Data Warehousing Total Marks: 20 Due Date: May 15, 2019 Objectives: After completing this assignment the students will be able to: • Identify Database entities from a given scenario • Understand the database table structure • Normalize a database table up to 3rd normal form Instructions Please read the following instructions carefully before submitting assignment: It should be clear that your assignment will not get any credit if: o Assignment is submitted after due date. o Submitted assignment does not open or file is corrupt. o Assignment is copied (From internet/ to from students). o Assignment is submitted other than word format (.doc, .docx). Assignment XYZ company was established in 1989 in Karachi and deals in the business of garments export. There are 250 permanent employees that are working with XYZ company. XYZ’s admin department is maintaining employee’s leave records in hard form. The leave form currently in practice is shown below: Now XYZ company wants you to design a Database solution for their organization, you are required to develop a database that is normalized up to the 3rd Normal form. In the future, this database can be used for developing front end application. A raw table structure along with attributes are provided by the XYZ’s management to start your working on it, is as following; employeeLeaveForm (employeeID, empName, designation, officeName, dateOfFill, leaveType, leaveFromDate, leaveToDate, leaveReason, leaveAddress, leaveContactNo, leaveProcessDate, leaveStatus, leaveProcessedBy), Sample data for the table is provided on VULMS for your reference. To view / download this data, use the following link; https://vulms.vu.edu.pk/Courses/CS614/Downloads/CS614 - SP 2019 - Assignment 1 - Sample Data.xlsx You have to use exact given attributes names while normalizing this table to 3rd Normal form. Instructions to solve assignment: Use table employeeLeaveForm and attributes in it. You are not allowed to remove any of the provided attributes You may introduce new tables by using the same attributes in employeeLeaveForm and also add new attributes for indexing / normalization purpose You are also required to declare Primary and Foreign Keys from the provided list of attributes (Also represent them with suitable DB notation while writing table schema) Transform the table in 3rd normal form step by step i.e. First Normal form transformation then into 2nd normal form and then into 3rd. Also, mention the title of Normal Form before the transformation. Direct transform relation into 3rd normal form would result in zero marks whether your solution is correct. And this will be imposed strictly while marking this assignment with no later excuses. Example of resultant tables / solution should be in the form provided below: tableNo1 (attribute1, attribute2, …) tableNo2 (attributeX, attributeY, …) tableNo3 (attributeI, attributeX, …) attribute1, attributeX and attributeI (with solid underline) are primary keys in tableNo1, tableNo2 and tableNo3 respectively while attribute (With dotted underline) is foreign key in tableNo3 from referee tableNo2. Deadline: Your assignment must be uploaded on VULMS on or before May 15, 2019. While May 16, 2019 will be a bonus day for assignment submission. After the bonus day, no assignment would be entertained via email.
CS614 - Data Warehousing

CS614 Quiz No. 2 Solution and Discussion

Scheduled Pinned Locked Moved Solved CS614 - Data Warehousing
cs614quiz.2 solution
34 Posts 4 Posters 30.2k Views 4 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • Khuram ShahzadK Offline
    Khuram ShahzadK Offline
    Khuram Shahzad
    wrote on last edited by
    #18

    Quiz Start Time: 04:09 PM Time Left 49
    sec(s)

    Question # 1 of 10 ( Start time: 04:09:48 PM ) Total Marks: 1
    In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’ rows in inner table, time complexity is
    Select correct option:

    O(MN) (Correct)

    Quiz Start Time: 04:09 PM Time Left 35
    sec(s)

    Question # 2 of 10 ( Start time: 04:11:20 PM ) Total Marks: 1
    In context of data mining definition, the term “value” means:
    Select correct option:

    importance of hidden perameters discovered (Correct)

    Quiz Start Time: 04:09 PM Time Left 56
    sec(s)

    Question # 3 of 10 ( Start time: 04:12:51 PM ) Total Marks: 1
    Classification consists of examining the properties of a newly presented observation and assigning it to a predefined ____________.
    Select correct option:

    Class (Correct)

    Quiz Start Time: 04:09 PM Time Left 25
    sec(s)

    Question # 4 of 10 ( Start time: 04:13:44 PM ) Total Marks: 1
    The optimizer uses a hash join to join two tables if they are joined using an equijoin and
    Select correct option:

    Large amount of data need to be joined (Correct)
    A large amount of data needs to be joined.
    A large portion of the table needs to be joined.

    Quiz Start Time: 04:09 PM Time Left 20
    sec(s)

    Question # 5 of 10 ( Start time: 04:15:07 PM ) Total Marks: 1
    In context of nested-loop join, actual number of matching rows returned as a result of the join would be ______ of the order of tables
    Select correct option:

    Independent (Correct)

    Quiz Start Time: 04:09 PM Time Left 13
    sec(s)

    Question # 6 of 10 ( Start time: 04:16:36 PM ) Total Marks: 1
    Normally the input data structure (a database table) for a data mining algorithm:
    Select correct option:

    Quiz Start Time: 04:09 PM Time Left 12
    sec(s)

    Question # 7 of 10 ( Start time: 04:18:07 PM ) Total Marks: 1
    Mining multi dimensional databases allow users to:
    Select correct option:

    Analyze Data (Correct)

    Quiz Start Time: 04:09 PM Time Left 24
    sec(s)

    Question # 8 of 10 ( Start time: 04:19:38 PM ) Total Marks: 1
    ________ refers to the overall process of discovering useful knowledge from data and data mining refers to a particular step in this process.
    Select correct option:

    Knowledge discovery in database (Correct)

    Quiz Start Time: 04:09 PM Time Left 17
    sec(s)

    Question # 9 of 10 ( Start time: 04:21:01 PM ) Total Marks: 1
    In case of nested-loop join, Inner table is accessed _____ for each qualifying row (or touple) in outer table
    Select correct option:

    One Time (Correct)

    Quiz Start Time: 04:09 PM Time Left 10
    sec(s)

    Question # 10 of 10 ( Start time: 04:22:32 PM ) Total Marks: 1
    In contrast to data mining, statistics is ______ driven.
    knowledge (Correct)

    Quiz Start Time: 08:50 PM Time Left 55
    sec(s)

    Question # 1 of 10 ( Start time: 08:50:34 PM ) Total Marks: 1
    In context of data mining definition, the term “value” means:
    Select correct option:

    The primary key of table
    The index location of the record
    Importance of hidden patterns discovered (Answer)
    Numerical or string measure assigned to an attribute

    Quiz Start Time: 08:50 PM Time Left 22
    sec(s)

    Question # 2 of 10 ( Start time: 08:51:23 PM ) Total Marks: 1
    Data mining is all about:
    Select correct option:

    Knowledge discovery in database
    Finding hidden patterns in data
    Finding relationships in data
    All of the given options ( may be Answer)

    Quiz Start Time: 08:50 PM Time Left 19
    sec(s)

    Question # 3 of 10 ( Start time: 08:52:41 PM ) Total Marks: 1
    In contrast to statistics, data mining is ______ driven.
    Select correct option:

    Assumption (Answer)
    Knowledge
    Human
    Database

    Quiz Start Time: 08:50 PM Time Left 55
    sec(s)

    Question # 4 of 10 ( Start time: 08:53:56 PM ) Total Marks: 1
    Mining multi dimensional databases allow users to:
    Select correct option:

    Categorize the data
    Analyze the data (Answer)
    Summarize the data
    All of the given options

    Quiz Start Time: 08:50 PM Time Left 18
    sec(s)

    Question # 5 of 10 ( Start time: 08:54:40 PM ) Total Marks: 1
    In context of data mining definition, the term “nontrivial” means:
    Select correct option:

    Discovering information is a simple task
    Discovering information is a complex task
    We can not discover information
    We simply find things rather than discovery (Answer)

    Quiz Start Time: 08:50 PM Time Left 1
    sec(s)

    Question # 6 of 10 ( Start time: 08:55:56 PM ) Total Marks: 1
    Identify the TRUE statement:
    Select correct option:

    Clustering is unsupervised learning and classification is supervised learning
    Clustering is supervised learning and classification is unsupervised learning
    Both clustering and classification are unsupervised learning
    Both clustering and classification are supervised learning

    Quiz Start Time: 08:50 PM Time Left 27
    sec(s)

    Question # 7 of 10 ( Start time: 08:57:26 PM ) Total Marks: 1
    In ________learning you don’t know the number of clusters and no idea about their attributes.
    Select correct option:

    Supervised learning
    Unsupervised learning (Answer)
    Multi Dimension modeling
    None of the given options

    Quiz Start Time: 08:50 PM Time Left 50
    sec(s)

    Question # 8 of 10 ( Start time: 08:58:39 PM ) Total Marks: 1
    In context of clustering, the term “distance” means:
    Select correct option:

    Similarity/dissimilarly of records (Answer)
    The difference between the primary keys of two records
    The relation of a record with corresponding record in child table
    None of the given options

    Quiz Start Time: 08:50 PM Time Left 15
    sec(s)

    Question # 9 of 10 ( Start time: 08:59:22 PM ) Total Marks: 1
    In data mining, initially you _____ what you are looking for.
    Select correct option:

    Know
    Don’t know (Answer)
    May or may not know
    None of the given options

    Quiz Start Time: 08:50 PM Time Left 52
    sec(s)

    Question # 10 of 10 ( Start time: 09:00:41 PM ) Total Marks: 1
    The optimizer uses a hash join to join two tables if they are joined using an equijoin and
    Select correct option:

    Outer table has less number of rows
    Inner table has less number of rows
    Cardinality of tables is equal
    Large amount of data needs to be joined (Answer)

    1 Reply Last reply
    1
    • zareenZ Offline
      zareenZ Offline
      zareen
      Cyberian's Gold
      wrote on last edited by
      #19

      In context of the most fundamental data warehouse life cycle model, which of the following is NOT one of the data warehouse design activities?
      Select correct option:
      End-user interviews and re-interviews
      Source system cataloguing
      Definition of key performance indicators
      System vision development

      Discussion is right way to get Solution of the every assignment, Quiz and GDB.
      We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
      Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
      %(red)[NOTE: Don't copy or replicating idea solutions.]
      Quiz Copy Solution
      Mid and Final Past Papers
      Live Chat

      1 Reply Last reply
      1
      • zareenZ Offline
        zareenZ Offline
        zareen
        Cyberian's Gold
        wrote on last edited by
        #20

        Vertically wide data means:

        Discussion is right way to get Solution of the every assignment, Quiz and GDB.
        We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
        Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
        %(red)[NOTE: Don't copy or replicating idea solutions.]
        Quiz Copy Solution
        Mid and Final Past Papers
        Live Chat

        1 Reply Last reply
        1
        • zareenZ Offline
          zareenZ Offline
          zareen
          Cyberian's Gold
          wrote on last edited by
          #21

          In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that:
          24833b95-8872-4181-91e6-a03d1854a8dc-image.png

          Discussion is right way to get Solution of the every assignment, Quiz and GDB.
          We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
          Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
          %(red)[NOTE: Don't copy or replicating idea solutions.]
          Quiz Copy Solution
          Mid and Final Past Papers
          Live Chat

          1 Reply Last reply
          1
          • zareenZ Offline
            zareenZ Offline
            zareen
            Cyberian's Gold
            wrote on last edited by
            #22

            Data mining is all about:

            Discussion is right way to get Solution of the every assignment, Quiz and GDB.
            We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
            Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
            %(red)[NOTE: Don't copy or replicating idea solutions.]
            Quiz Copy Solution
            Mid and Final Past Papers
            Live Chat

            1 Reply Last reply
            1
            • zareenZ Offline
              zareenZ Offline
              zareen
              Cyberian's Gold
              wrote on last edited by
              #23

              “If resources increase in proportion to increase in data size, time is constant”. The statement refers to:

              9c6926fd-f29e-42b2-993a-f1b9ad44643e-image.png
              8e9d55a5-335f-4d45-8fee-d14cb6975ce2-image.png

              Discussion is right way to get Solution of the every assignment, Quiz and GDB.
              We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
              Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
              %(red)[NOTE: Don't copy or replicating idea solutions.]
              Quiz Copy Solution
              Mid and Final Past Papers
              Live Chat

              1 Reply Last reply
              1
              • zareenZ Offline
                zareenZ Offline
                zareen
                Cyberian's Gold
                wrote on last edited by
                #24

                An effective user education program includes, among others, the following guideline(s):

                bdbe9a2f-8438-4cad-a916-12f573d62471-image.png

                Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                %(red)[NOTE: Don't copy or replicating idea solutions.]
                Quiz Copy Solution
                Mid and Final Past Papers
                Live Chat

                1 Reply Last reply
                1
                • zareenZ Offline
                  zareenZ Offline
                  zareen
                  Cyberian's Gold
                  wrote on last edited by
                  #25

                  Implementation of a data warehouse requires ________ activities

                  Highly integrated

                  Loosely integrated

                  Tightly decoupled

                  None of the given

                  Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                  We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                  Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                  %(red)[NOTE: Don't copy or replicating idea solutions.]
                  Quiz Copy Solution
                  Mid and Final Past Papers
                  Live Chat

                  1 Reply Last reply
                  1
                  • zareenZ Offline
                    zareenZ Offline
                    zareen
                    Cyberian's Gold
                    wrote on last edited by
                    #26

                    Which of the following is NOT one of the three parallel tracks in Kimballs approach? CS614

                    Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                    We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                    Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                    %(red)[NOTE: Don't copy or replicating idea solutions.]
                    Quiz Copy Solution
                    Mid and Final Past Papers
                    Live Chat

                    1 Reply Last reply
                    1
                    • zareenZ Offline
                      zareenZ Offline
                      zareen
                      Cyberian's Gold
                      wrote on last edited by
                      #27

                      Which of the following is NOT one of the methodologies for Data Warehouse project development?

                      System Driven

                      Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                      We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                      Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                      %(red)[NOTE: Don't copy or replicating idea solutions.]
                      Quiz Copy Solution
                      Mid and Final Past Papers
                      Live Chat

                      1 Reply Last reply
                      1
                      • zareenZ Offline
                        zareenZ Offline
                        zareen
                        Cyberian's Gold
                        wrote on last edited by
                        #28

                        In contrast to data mining, statistics is ______ driven. CS614
                        073574bf-d5f3-42b7-b52b-37c8a4a3de3d-image.png

                        Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                        We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                        Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                        %(red)[NOTE: Don't copy or replicating idea solutions.]
                        Quiz Copy Solution
                        Mid and Final Past Papers
                        Live Chat

                        1 Reply Last reply
                        1
                        • zareenZ Offline
                          zareenZ Offline
                          zareen
                          Cyberian's Gold
                          wrote on last edited by
                          #29

                          Quiz Start Time: 04:09 PM Time Left 49
                          sec(s)

                          Question # 1 of 10 ( Start time: 04:09:48 PM ) Total Marks: 1
                          In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’ rows in inner table, time complexity is
                          Select correct option:

                          O(MN) (Correct)

                          Quiz Start Time: 04:09 PM Time Left 35
                          sec(s)

                          Question # 2 of 10 ( Start time: 04:11:20 PM ) Total Marks: 1
                          In context of data mining definition, the term “value” means:
                          Select correct option:

                          importance of hidden perameters discovered (Correct)

                          Quiz Start Time: 04:09 PM Time Left 56
                          sec(s)

                          Question # 3 of 10 ( Start time: 04:12:51 PM ) Total Marks: 1
                          Classification consists of examining the properties of a newly presented observation and assigning it to a predefined ____________.
                          Select correct option:

                          Class (Correct)

                          Quiz Start Time: 04:09 PM Time Left 25
                          sec(s)

                          Question # 4 of 10 ( Start time: 04:13:44 PM ) Total Marks: 1
                          The optimizer uses a hash join to join two tables if they are joined using an equijoin and
                          Select correct option:

                          Large amount of data need to be joined (Correct)
                          A large amount of data needs to be joined.
                          A large portion of the table needs to be joined.

                          Quiz Start Time: 04:09 PM Time Left 20
                          sec(s)

                          Question # 5 of 10 ( Start time: 04:15:07 PM ) Total Marks: 1
                          In context of nested-loop join, actual number of matching rows returned as a result of the join would be ______ of the order of tables
                          Select correct option:

                          Independent (Correct)

                          Quiz Start Time: 04:09 PM Time Left 13
                          sec(s)

                          Question # 6 of 10 ( Start time: 04:16:36 PM ) Total Marks: 1
                          Normally the input data structure (a database table) for a data mining algorithm:
                          Select correct option:

                          Quiz Start Time: 04:09 PM Time Left 12
                          sec(s)

                          Question # 7 of 10 ( Start time: 04:18:07 PM ) Total Marks: 1
                          Mining multi dimensional databases allow users to:
                          Select correct option:

                          Analyze Data (Correct)

                          Quiz Start Time: 04:09 PM Time Left 24
                          sec(s)

                          Question # 8 of 10 ( Start time: 04:19:38 PM ) Total Marks: 1
                          ________ refers to the overall process of discovering useful knowledge from data and data mining refers to a particular step in this process.
                          Select correct option:

                          Knowledge discovery in database (Correct)

                          Quiz Start Time: 04:09 PM Time Left 17
                          sec(s)

                          Question # 9 of 10 ( Start time: 04:21:01 PM ) Total Marks: 1
                          In case of nested-loop join, Inner table is accessed _____ for each qualifying row (or touple) in outer table
                          Select correct option:

                          One Time (Correct)

                          Quiz Start Time: 04:09 PM Time Left 10
                          sec(s)

                          Question # 10 of 10 ( Start time: 04:22:32 PM ) Total Marks: 1
                          In contrast to data mining, statistics is ______ driven.
                          knowledge (Correct)

                          Quiz Start Time: 08:50 PM Time Left 55
                          sec(s)

                          Question # 1 of 10 ( Start time: 08:50:34 PM ) Total Marks: 1
                          In context of data mining definition, the term “value” means:
                          Select correct option:

                          The primary key of table
                          The index location of the record
                          Importance of hidden patterns discovered (Answer)
                          Numerical or string measure assigned to an attribute

                          Quiz Start Time: 08:50 PM Time Left 22
                          sec(s)

                          Question # 2 of 10 ( Start time: 08:51:23 PM ) Total Marks: 1
                          Data mining is all about:
                          Select correct option:

                          Knowledge discovery in database
                          Finding hidden patterns in data
                          Finding relationships in data
                          All of the given options ( may be Answer)

                          Quiz Start Time: 08:50 PM Time Left 19
                          sec(s)

                          Question # 3 of 10 ( Start time: 08:52:41 PM ) Total Marks: 1
                          In contrast to statistics, data mining is ______ driven.
                          Select correct option:

                          Assumption (Answer)
                          Knowledge
                          Human
                          Database

                          Quiz Start Time: 08:50 PM Time Left 55
                          sec(s)

                          Question # 4 of 10 ( Start time: 08:53:56 PM ) Total Marks: 1
                          Mining multi dimensional databases allow users to:
                          Select correct option:

                          Categorize the data
                          Analyze the data (Answer)
                          Summarize the data
                          All of the given options

                          Quiz Start Time: 08:50 PM Time Left 18
                          sec(s)

                          Question # 5 of 10 ( Start time: 08:54:40 PM ) Total Marks: 1
                          In context of data mining definition, the term “nontrivial” means:
                          Select correct option:

                          Discovering information is a simple task
                          Discovering information is a complex task
                          We can not discover information
                          We simply find things rather than discovery (Answer)

                          Quiz Start Time: 08:50 PM Time Left 1
                          sec(s)

                          Question # 6 of 10 ( Start time: 08:55:56 PM ) Total Marks: 1
                          Identify the TRUE statement:
                          Select correct option:

                          Clustering is unsupervised learning and classification is supervised learning
                          Clustering is supervised learning and classification is unsupervised learning
                          Both clustering and classification are unsupervised learning
                          Both clustering and classification are supervised learning

                          Quiz Start Time: 08:50 PM Time Left 27
                          sec(s)

                          Question # 7 of 10 ( Start time: 08:57:26 PM ) Total Marks: 1
                          In ________learning you don’t know the number of clusters and no idea about their attributes.
                          Select correct option:

                          Supervised learning
                          Unsupervised learning (Answer)
                          Multi Dimension modeling
                          None of the given options

                          Quiz Start Time: 08:50 PM Time Left 50
                          sec(s)

                          Question # 8 of 10 ( Start time: 08:58:39 PM ) Total Marks: 1
                          In context of clustering, the term “distance” means:
                          Select correct option:

                          Similarity/dissimilarly of records (Answer)
                          The difference between the primary keys of two records
                          The relation of a record with corresponding record in child table
                          None of the given options

                          Quiz Start Time: 08:50 PM Time Left 15
                          sec(s)

                          Question # 9 of 10 ( Start time: 08:59:22 PM ) Total Marks: 1
                          In data mining, initially you _____ what you are looking for.
                          Select correct option:

                          Know
                          Don’t know (Answer)
                          May or may not know
                          None of the given options

                          Quiz Start Time: 08:50 PM Time Left 52
                          sec(s)

                          Question # 10 of 10 ( Start time: 09:00:41 PM ) Total Marks: 1
                          The optimizer uses a hash join to join two tables if they are joined using an equijoin and
                          Select correct option:

                          Outer table has less number of rows
                          Inner table has less number of rows
                          Cardinality of tables is equal
                          Large amount of data needs to be joined (Answer)

                          Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                          We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                          Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                          %(red)[NOTE: Don't copy or replicating idea solutions.]
                          Quiz Copy Solution
                          Mid and Final Past Papers
                          Live Chat

                          1 Reply Last reply
                          1
                          • zareenZ Offline
                            zareenZ Offline
                            zareen
                            Cyberian's Gold
                            wrote on last edited by
                            #30

                            1_ As per Bill Inmost, a data warehouse, in contrast with classical applications is:

                            Data driven _ pg285

                            2_ Which of the following is NOT one of the three parallel tracks in Kimball’s approach?
                            Lifecycle Maintenance track

                            3_ Bill Inmon argues that requirements are well understood only after
                            Data warehouse is populated _ pg285

                            4_ Goal driven approach of data warehouse development was result of ______ work
                            Böhnlein and Ulbrich-vom _ pg285

                            5_ Identify the TRUE statement:
                            Clustering is unsupervised learning and classification is supervised learning _ pg 270

                            6_ Normally the term “DWH face to the business user” refers to:
                            Lifecycle Analytical Applications track _ pg 306

                            7_ In ________learning you don’t know the number of clusters and no idea about their attributes.
                            Unsupervised learning
                            https://www.cs.uic.edu/~liub/teach/cs583-fall-05/CS583-unsupervised-learning.ppt

                            8_ Waterfall model is appropriate when
                            Requirements are clearly defined _ pg 284

                            9_ Implementation of a data warehouse requires ________ activities.
                            none of above

                            10_ Normally the input data structure (a database table) for a data mining algorithm:
                            Has more number of records than attributes (not sure)

                            Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                            We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                            Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                            %(red)[NOTE: Don't copy or replicating idea solutions.]
                            Quiz Copy Solution
                            Mid and Final Past Papers
                            Live Chat

                            cyberianC 3 Replies Last reply
                            0
                            • zareenZ zareen

                              1_ In context of data parallelism, the work done by query processor should be:

                              Maximum

                              2_ _______ do not (typically) keep the index values in stored order

                              Hash based index

                              3_ if every key in the data is represented in the index file then it is called

                              Dense index

                              4_ In context of bitmap index, the length of the bit vector is:

                              the number of records in the base table

                              5_ In context of joining tables, the join condition is specified in _____ clause.

                              WHERE

                              6_ A join is identified by multiple tables in the _____ clause.

                              From

                              7_ Parallelism can be exploited, if there is:

                              All of the given options

                              8_ In ____ index, the ith bit is set to “1” if the ith row of the base table has the value for the indexed column.

                              Bitmap index

                              9_ As the number of processors increase, the speedup should also increase. thus we should have linear speedup. Which of the following is NOT the one of the barriers

                              to achieve this linear speed-up?

                              Amdah’l Law not sure

                              10_ Bitmap index is appropriate for:

                              Low cardinality data

                              Q1: in context of nested-loop join, actual number os matching rows returned as a result of the join would be ________ of the order of tables.
                              Independent.

                              Q2: Which of the following is NOT one of the parallel hardware architecture?
                              Shared Memory

                              Q3: If resources increase in proportion to increase in data size. time is constant’. The statement refers to:
                              Scale-Up

                              Q4: If every key in the data file is represented in the index file then it is called?
                              Dense Index

                              Q5: In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that.
                              All

                              Q6: In nested-loop join case, if there are ‘M’ rows in outer table and N rows in inner table, time complexity is.
                              o(MN)

                              Q7: The goal of__________ is to look at as few blocks as possib le to find the matching records(s).
                              Indexing

                              Q8: Parallelism can be exploited, if there is.
                              All of the given options

                              Q9: If we apply Run Length Encoding on the input “11001100”, the output will be.
                              21#20#21#20

                              Q10: Which of the following is NOT one of the variants of Nested-loop join?
                              Binary index nested-loop join.

                              Q11: In context of data parallelism, the work done by query processor should be:
                              Maximum.

                              Q12: ___________ do not (typically) keep the index values in sorted oreder
                              Hash based Index

                              Q13: if every key the data file is represented in the index file then it is called.
                              Dense Index

                              Q14: In context of bitmap index, the length of the bit vector is:
                              The number of records in the base table

                              Q15; In context of joining tables, the join condition is specified in ______ clause:
                              Where

                              Q16: A join is identified by multiple tables in the________ clause.
                              From

                              Q17: Parallelism can exploited, if there is
                              All of the given options

                              Q18: In ________ index, the ith bit is set to “1” if the ith row of the base table has the value for the index column
                              Bitmap index

                              Q19: As the number of processors increase, the speedup should also increase. Thus we should have linear speedup. Which of the following is NOT one of the barriers to achieve this linear speed-up?
                              Amdahl’ Law

                              Q20: Bitmap index is appropriate for:
                              Low cardinality data

                              Q21: If a task takes “T” time units to execute on a single data item, then execution of the task on “N” data items will take______ time units?
                              N*T

                              Q22: _________ lists each term in the collection only once and then shows a list of all the documents the contain the given term.
                              Inverted index

                              Q23: “More resources means proportionally less time for given amount of data”. The statement refers to:
                              Speed-UP

                              Q24: In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that:
                              All of the given option

                              Q25: In context of bitmap index, the length of the bit vector is
                              the number of records in the base table.

                              Q26: One of the preconditions to decide about operations to be parallelized is that:
                              Operation can be implemented independent of each other

                              Q27: A_________ index, if fits in the memory, costs only one disk I/O access to locate a record given a key.
                              Dense Index

                              Q28: In context of nested-loop join, actual number of matching rows returned as a result of the join would be ___ of the order of tables
                              Independent

                              Q29: __________ refers to “ Parallelexectution of single data operation across multiple partitions of data”
                              Data Parallelism.

                              A join is identified by multiple tables in the _ FROM ___ clause

                              In context of joining tables, the join condition is specified in _ WHERE ___ clause

                              The goal of ______ ing Goal _____ is to look at as few blocks as possible to find the matching records(s).

                              __ Sparse Index _____ index uses even less space than __ dense ____ index, but the block has to be searched, even for unsuccessful searches.

                              In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that:

                              If we apply Run Length Encoding on the input “11001100”, the output will be:

                              In B-tree index, the lowest level index blocks are called leaf blocks, and these blocks contain:

                              every indexed data value and a corresponding ROWID

                              ___ Sparse Index ___ index stores first value in each block in the sequential file and a pointer to the block

                              1_ In context of data parallelism, the work done by query processor should be:

                              Maximum

                              2_ _______ do not (typically) keep the index values in stored order

                              Hash based index

                              3_ if every key in the data is represented in the index file then it is called

                              Dense index

                              4_ In context of bitmap index, the length of the bit vector is:

                              the number of records in the base table

                              5_ In context of joining tables, the join condition is specified in _____ clause.

                              WHERE

                              6_ A join is identified by multiple tables in the _____ clause.

                              From

                              7_ Parallelism can be exploited, if there is:

                              All of the given options

                              8_ In ____ index, the ith bit is set to “1” if the ith row of the base table has the value for the indexed column.

                              Bitmap index

                              9_ As the number of processors increase, the speedup should also increase. thus we should have linear speedup. Which of the following is NOT the one of the barriers

                              to achieve this linear speed-up?

                              Amdah’l Law not sure

                              10_ Bitmap index is appropriate for:

                              Low cardinality data

                              Q1: in context of nested-loop join, actual number os matching rows returned as a result of the join would be ________ of the order of tables.
                              Independent.

                              Q2: Which of the following is NOT one of the parallel hardware architecture?
                              Shared Memory

                              Q3: If resources increase in proportion to increase in data size. time is constant’. The statement refers to:
                              Scale-Up

                              Q4: If every key in the data file is represented in the index file then it is called?
                              Dense Index

                              Q5: In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that.
                              All

                              Q6: In nested-loop join case, if there are ‘M’ rows in outer table and N rows in inner table, time complexity is.
                              o(MN)

                              Q7: The goal of__________ is to look at as few blocks as possib le to find the matching records(s).
                              Indexing

                              Q8: Parallelism can be exploited, if there is.
                              All of the given options

                              Q9: If we apply Run Length Encoding on the input “11001100”, the output will be.
                              21#20#21#20

                              Q10: Which of the following is NOT one of the variants of Nested-loop join?
                              Binary index nested-loop join.

                              Q11: In context of data parallelism, the work done by query processor should be:
                              Maximum.

                              Q12: ___________ do not (typically) keep the index values in sorted oreder
                              Hash based Index

                              Q13: if every key the data file is represented in the index file then it is called.
                              Dense Index

                              Q14: In context of bitmap index, the length of the bit vector is:
                              The number of records in the base table

                              Q15; In context of joining tables, the join condition is specified in ______ clause:
                              Where

                              Q16: A join is identified by multiple tables in the________ clause.
                              From

                              Q17: Parallelism can exploited, if there is
                              All of the given options

                              Q18: In ________ index, the ith bit is set to “1” if the ith row of the base table has the value for the index column
                              Bitmap index

                              Q19: As the number of processors increase, the speedup should also increase. Thus we should have linear speedup. Which of the following is NOT one of the barriers to achieve this linear speed-up?
                              Amdahl’ Law

                              Q20: Bitmap index is appropriate for:
                              Low cardinality data

                              Q21: If a task takes “T” time units to execute on a single data item, then execution of the task on “N” data items will take______ time units?
                              N*T

                              Q22: _________ lists each term in the collection only once and then shows a list of all the documents the contain the given term.
                              Inverted index

                              Q23: “More resources means proportionally less time for given amount of data”. The statement refers to:
                              Speed-UP

                              Q24: In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that:
                              All of the given option

                              Q25: In context of bitmap index, the length of the bit vector is
                              the number of records in the base table.

                              Q26: One of the preconditions to decide about operations to be parallelized is that:
                              Operation can be implemented independent of each other

                              Q27: A_________ index, if fits in the memory, costs only one disk I/O access to locate a record given a key.
                              Dense Index

                              Q28: In context of nested-loop join, actual number of matching rows returned as a result of the join would be ___ of the order of tables
                              Independent

                              Q29: __________ refers to “ Parallelexectution of single data operation across multiple partitions of data”
                              Data Parallelism.

                              A join is identified by multiple tables in the _ FROM ___ clause

                              In context of joining tables, the join condition is specified in _ WHERE ___ clause

                              The goal of ______ ing Goal _____ is to look at as few blocks as possible to find the matching records(s).

                              __ Sparse Index _____ index uses even less space than __ dense ____ index, but the block has to be searched, even for unsuccessful searches.

                              In context of data parallelism, to get a speed-up of N with N partitions, it must be ensured that:

                              If we apply Run Length Encoding on the input “11001100”, the output will be:

                              In B-tree index, the lowest level index blocks are called leaf blocks, and these blocks contain:

                              every indexed data value and a corresponding ROWID

                              ___ Sparse Index ___ index stores first value in each block in the sequential file and a pointer to the block

                              cyberianC Offline
                              cyberianC Offline
                              cyberian
                              Cyberian's Cyberian's Gold
                              wrote on last edited by
                              #31

                              @zareen said in CS614 Quiz No. 2 Solution and Discussion:

                              Q10: Which of the following is NOT one of the variants of Nested-loop join?
                              Binary index nested-loop join.

                              The following are variants of the nested-loop join:

                              1. Basic Nested-Loop Join: A straightforward nested-loop join where for each tuple in the outer relation, the inner relation is scanned entirely.

                              2. Block Nested-Loop Join: Instead of processing one tuple at a time, this method processes a block of tuples from the outer relation, which can reduce the number of disk I/O operations.

                              3. Index Nested-Loop Join: This variant uses an index on the inner relation to quickly find matching tuples, which can significantly speed up the join operation.

                              NOT a variant:

                              1. Hash Join: This is not a variant of the nested-loop join. Instead, it’s a different join algorithm that uses a hash table to partition one or both of the relations before performing the join.

                              So, Hash Join would be the correct answer if you’re asked which one is not a variant of the nested-loop join.

                              Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                              We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                              Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                              [NOTE: Don't copy or replicating idea solutions.]
                              Quiz Copy Solution
                              Mid and Final Past Papers
                              WhatsApp Channel
                              Mobile Tax Calculator

                              1 Reply Last reply
                              0
                              • zareenZ zareen

                                1_ As per Bill Inmost, a data warehouse, in contrast with classical applications is:

                                Data driven _ pg285

                                2_ Which of the following is NOT one of the three parallel tracks in Kimball’s approach?
                                Lifecycle Maintenance track

                                3_ Bill Inmon argues that requirements are well understood only after
                                Data warehouse is populated _ pg285

                                4_ Goal driven approach of data warehouse development was result of ______ work
                                Böhnlein and Ulbrich-vom _ pg285

                                5_ Identify the TRUE statement:
                                Clustering is unsupervised learning and classification is supervised learning _ pg 270

                                6_ Normally the term “DWH face to the business user” refers to:
                                Lifecycle Analytical Applications track _ pg 306

                                7_ In ________learning you don’t know the number of clusters and no idea about their attributes.
                                Unsupervised learning
                                https://www.cs.uic.edu/~liub/teach/cs583-fall-05/CS583-unsupervised-learning.ppt

                                8_ Waterfall model is appropriate when
                                Requirements are clearly defined _ pg 284

                                9_ Implementation of a data warehouse requires ________ activities.
                                none of above

                                10_ Normally the input data structure (a database table) for a data mining algorithm:
                                Has more number of records than attributes (not sure)

                                cyberianC Offline
                                cyberianC Offline
                                cyberian
                                Cyberian's Cyberian's Gold
                                wrote on last edited by
                                #32

                                @zareen said in CS614 Quiz No. 2 Solution and Discussion:

                                1_ As per Bill Inmost, a data warehouse, in contrast with classical applications is:

                                Data driven _ pg285

                                According to Bill Inmon, a prominent figure in the field of data warehousing, a data warehouse is distinguished from classical applications by several key characteristics:

                                1. Subject-Oriented: Data warehouses are designed around key subjects or business areas (such as sales, finance, or customer information), rather than focusing on individual applications or processes.

                                2. Integrated: Data in a data warehouse is collected from multiple sources and is integrated into a consistent format. This integration allows for unified reporting and analysis across the organization.

                                3. Time-Variant: Data warehouses store historical data, allowing users to analyze trends and changes over time. This time dimension is crucial for tracking business performance and making long-term decisions.

                                4. Non-Volatile: Once data is entered into a data warehouse, it is not typically updated or deleted. This ensures that the data remains stable and can be used for historical analysis without being altered.

                                5. Optimized for Querying and Reporting: Unlike classical applications, which are optimized for transaction processing, data warehouses are optimized for complex queries and analysis. This optimization allows for efficient reporting and decision-making.

                                In summary, according to Bill Inmon, a data warehouse is subject-oriented, integrated, time-variant, non-volatile, and optimized for querying and reporting, distinguishing it from classical applications that are more focused on transaction processing and operational tasks.

                                Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                                We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                                Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                                [NOTE: Don't copy or replicating idea solutions.]
                                Quiz Copy Solution
                                Mid and Final Past Papers
                                WhatsApp Channel
                                Mobile Tax Calculator

                                1 Reply Last reply
                                0
                                • zareenZ zareen

                                  1_ As per Bill Inmost, a data warehouse, in contrast with classical applications is:

                                  Data driven _ pg285

                                  2_ Which of the following is NOT one of the three parallel tracks in Kimball’s approach?
                                  Lifecycle Maintenance track

                                  3_ Bill Inmon argues that requirements are well understood only after
                                  Data warehouse is populated _ pg285

                                  4_ Goal driven approach of data warehouse development was result of ______ work
                                  Böhnlein and Ulbrich-vom _ pg285

                                  5_ Identify the TRUE statement:
                                  Clustering is unsupervised learning and classification is supervised learning _ pg 270

                                  6_ Normally the term “DWH face to the business user” refers to:
                                  Lifecycle Analytical Applications track _ pg 306

                                  7_ In ________learning you don’t know the number of clusters and no idea about their attributes.
                                  Unsupervised learning
                                  https://www.cs.uic.edu/~liub/teach/cs583-fall-05/CS583-unsupervised-learning.ppt

                                  8_ Waterfall model is appropriate when
                                  Requirements are clearly defined _ pg 284

                                  9_ Implementation of a data warehouse requires ________ activities.
                                  none of above

                                  10_ Normally the input data structure (a database table) for a data mining algorithm:
                                  Has more number of records than attributes (not sure)

                                  cyberianC Offline
                                  cyberianC Offline
                                  cyberian
                                  Cyberian's Cyberian's Gold
                                  wrote on last edited by
                                  #33

                                  @zareen said in CS614 Quiz No. 2 Solution and Discussion:

                                  4_ Goal driven approach of data warehouse development was result of ______ work
                                  Böhnlein and Ulbrich-vom _ pg285

                                  The goal-driven approach to data warehouse development resulted from Bill Inmon’s work.

                                  Bill Inmon, often referred to as the “father of data warehousing,” advocated for a methodical, top-down approach to data warehouse development. This approach focuses on defining business goals and requirements first and then designing the data warehouse to meet these needs. It emphasizes the importance of a clear understanding of business objectives and data requirements before designing and implementing the data warehouse.

                                  In contrast, Ralph Kimball’s approach is known for its bottom-up methodology, which focuses on building data marts first and then integrating them into a comprehensive data warehouse. Both approaches have their own merits, but Inmon’s goal-driven approach is specifically recognized for its emphasis on aligning data warehouse development with business goals and objectives.

                                  Discussion is right way to get Solution of the every assignment, Quiz and GDB.
                                  We are always here to discuss and Guideline, Please Don't visit Cyberian only for Solution.
                                  Cyberian Team always happy to facilitate to provide the idea solution. Please don't hesitate to contact us!
                                  [NOTE: Don't copy or replicating idea solutions.]
                                  Quiz Copy Solution
                                  Mid and Final Past Papers
                                  WhatsApp Channel
                                  Mobile Tax Calculator

                                  1 Reply Last reply
                                  0

                                  Reply
                                  • Reply as topic
                                  Log in to reply
                                  • Oldest to Newest
                                  • Newest to Oldest
                                  • Most Votes


                                  How to Build a $1,000/Month PAK VS BAN Live Live Cricket Streaming
                                  File Sharing
                                  Earn with File Sharing

                                  0

                                  Online

                                  3.0k

                                  Users

                                  2.8k

                                  Topics

                                  8.2k

                                  Posts
                                  solution
                                  1235
                                  discussion
                                  1195
                                  fall 2019
                                  813
                                  assignment 1
                                  428
                                  assignment 2
                                  294
                                  spring 2020
                                  265
                                  gdb 1
                                  238
                                  assignment 3
                                  79
                                  • PM. IMRAN KHAN
                                    undefined
                                    4
                                    1
                                    4.0k

                                  • Are the vaccines halal or not?
                                    undefined
                                    4
                                    1
                                    3.8k

                                  • All Subjects MidTerm and Final Term Solved Paper Links Attached Please check moaaz past papers
                                    zaasmiZ
                                    zaasmi
                                    3
                                    26
                                    75.1k

                                  • CS614 GDB Solution and Discussion
                                    M
                                    moaaz
                                    3
                                    3
                                    8.1k

                                  • How can I receive Reputation earning from Cyberian? 100% Discount on Fee
                                    Y
                                    ygytyh
                                    3
                                    28
                                    23.9k
                                  | |
                                  Copyright © 2010-26 RUP Technologies LLC. USA | Contributors
                                  • Login

                                  • Don't have an account? Register

                                  • Login or register to search.
                                  • First post
                                    Last post
                                  0
                                  • Categories
                                  • Recent
                                  • Tags
                                  • Popular
                                  • Pro Blog
                                  • Users
                                  • Groups
                                  • Unsolved
                                  • Solved