Data Mining, Data Warehouse MCQ, with Free Download.
21. Classification task referred to
(A) A subdivision of a set of examples into a number of classes
(B) A measure of the accuracy, of the classification of a concept that is given by a certain theory
(C) The task of assigning a classification to a set of examples
(D) None of these
22. Database is
(A) Large collection of data mostly stored in a computer system
(B) The removal of noise errors and incorrect input from a database
(C) The systematic description of the syntactic structure of a specific database. It describes the structure of the attributes the tables and foreign key relationships.
(D) None of these
23. Data cleaning is
(A) Large collection of data mostly stored in a computer system
(B) The removal of noise errors and incorrect input from a database
(C) The systematic description of the syntactic structure of a specific database. It describes the structure of the attributes the tables and foreign key relationships.
(D) None of these
24. Data dictionary is
(A) Large collection of data mostly stored in a computer system
(B) The removal of noise errors and incorrect input from a database
(C) The systematic description of the syntactic structure of a specific database. It describes the structure of the attributes the tables and foreign key relationships.
(D) None of these
25. Data mining is
(A) The actual discovery phase of a knowledge discovery process
(B) The stage of selecting the right data for a KDD process
(C) A subject-oriented integrated timevariant non-volatile collection of data in support of management
(D) None of these
26. Data selection is
(A) The actual discovery phase of a knowledge discovery process
(B) The stage of selecting the right data for a KDD process
(C) A subject-oriented integrated timevariant non-volatile collection of data in support of management
(D) None of these
27. Data warehouse is
(A) The actual discovery phase of a knowledge discovery process
(B) The stage of selecting the right data for a KDD process
(C) A subject-oriented integrated timevariant non-volatile collection of data in support of management
(D) None of these
28. Coding is
(A) Group of similar objects that differ significantly from other objects
(B) Operations on a database to transform or simplify data in order to prepare it for a machine-learning algorithm
(C) Symbolic representation of facts or ideas from which information can potentially be extracted
(D) None of these
29. DB/2 is
(A) A family of relational database management systems marketed by IBM
(B) Interactive systems that enable decision makers to use databases and models on a computer in order to solve illstructured problems
(C) It consists of nodes and branches starting from a single root node. Each node represents a test, or decision.
(D) None of these
30. Decision support systems (DSS) is
(A) A family of relational database management systems marketed by IBM
(B) Interactive systems that enable decision makers to use databases and models on a computer in order to solve illstructured problems
(C) It consists of nodes and branches starting from a single root node. Each node represents a test, or decision.
(D) None of these
31. Decision trees is
(A) A family of relational database management systems marketed by IBM
(B) Interactive systems that enable decision makers to use databases and models on a computer in order to solve illstructured problems
(C) It consists of nodes and branches starting from a single root node. Each node represents a test, or decision.
(D) None of these
32. Deep knowledge referred to
(A) It is hidden within a database and can only be recovered if one is given certain clues (an example IS encrypted information)
(B) The process of executing implicit previously unknown and potentially useful information from dat(A)
(C) An extremely complex molecule that occurs in human chromosomes and that carries genetic information in the form of genes.
(D) None of these
33. Discovery is
(A) It is hidden within a database and can only be recovered if one is given certain clues (an example IS encrypted information).
(B) The process of executing implicit previously unknown and potentially useful information from dat(A)
(C) An extremely complex molecule that occurs in human chromosomes and that carries genetic information in the form of genes.
(D) None of these
34. DNA (Deoxyribonucleic acid)
(A) It is hidden within a database and can only be recovered if one is given certain clues (an example IS encrypted information).
(B) The process of executing implicit previously unknown and potentially useful information from dat (A)
(C) An extremely complex molecule that occurs in human chromosomes and that carries genetic information in the form of genes.
(D) None of these
35. Enrichment is
(A) A stage of the KDD process in which new data is added to the existing selection
(B) The process of finding a solution for a problem simply by enumerating all possible solutions according to some pre-defined order and then testing them.
(C) The distance between two points as calculated using the Pythagoras theorem.
(D) None of these
36. Enumeration is referred to
(A) A stage of the KDD process in which new data is added to the existing selection.
(B) The process of finding a solution for a problem simply by enumerating all possible solutions according to some pre-defined order and then testing them
(C) The distance between two points as calculated using the Pythagoras theorem.
(D) None of these
37. Euclidean distance measure is
(A) A stage of the KDD process in which new data is added to the existing selection.
(B) The process of finding a solution for a problem simply by enumerating all possible solutions according to some pre-defined order and then testing them.
(C) The distance between two points as calculated using the Pythagoras theorem
(D) None of these
38. Heuristic is
(A) A set of databases from different vendors, possibly using different database paradigms
(B) An approach to a problem that is not guaranteed to work but performs well in most cases.
(C) Information that is hidden in a database and that cannot be recovered by a simple SQL query.
(D) None of these
39. Heterogeneous databases referred to
(A) A set of databases from different vendors, possibly using different database paradigms
(B) An approach to a problem that is not guaranteed to work but performs well in most cases.
(C) Information that is hidden in a database and that cannot be recovered by a simple SQL query.
(D) None of these
40. Hidden knowledge referred to
(A) A set of databases from different vendors, possibly using different database paradigms
(B) An approach to a problem that is not guaranteed to work but performs well in most cases.
(C) Information that is hidden in a database and that cannot be recovered by a simple SQL query.
(D) None of these
21. (C) 22. (A) 23. (B) 24. (C) 25. (A)
26. (B) 27. (C) 28. (B) 29. (A) 30. (B)
31. (C) 32. (A) 33. (B) 34. (C) 35. (A)
36. (B) 37. (C) 38. (B) 39. (A) 40. (C)