Week 1 : Assignment 1 Which answer is the most appropriate description of the concept of Recursion the reduction of the solution to a problem into a sequence of solutions to sub-problems the reduction of the solution to a problem into a parallell solutions to sub-problems the decomposition of a problem into sub-problems followed by the solution of these sub-problems the reduction of the solution to a problem into the the same form of solutions to sub-problems Accepted Answers: the reduction of the solution to a problem into the the same form of solutions to sub-problems Which is the most appropriate description of the concept of Tensor ? A matrix with vectors as components A multidimensional table that is a n-dimensional generalization of scalars, vectors and matrices A matrix with more and two dimensions Accepted Answers: A multidimensional table that is a n-dimensional generalization of scalars, vectors and matrices Which is the most appropriate characterization of a Hamiltonian graph? A graph that can be drawn in a plane without any edges crossed A graph for which exists a closed path that traverses the vertices of the graph exactly once A graph for which exists a closed path that traverses the edges of the graph exactly once Accepted Answers: A graph for which exists a closed path that traverses the vertices of the graph exactly once Which is the most appropriate characterization of the concept of Convolution? The recursive application of several functions on an argument A mathematical operation which when applied to two functions (f and g) produces a third function. An integral where the bounds are functions Accepted Answers: A mathematical operation which when applied to two functions (f and g) produces a third function. Which is the most appropriate characterization of a Multivariate function a function with properties defined by another function a function that assumes two or more values for at least one argument value a function of more than one variable Accepted Answers: a function of more than one variable Which is the most appropriate characterization of the concept of Regression? A technique for estimating relationships between variables A technique for infering causes from effects A technique for reducing compounds onto components Accepted Answers: A technique for estimating relationships between variables Who pioneered work on Computer Vision in 1957? Marvin Minsky Ray Solomonoff Oliver Selfridge John Holland Herbert Simon Frank Rosenblatt Accepted Answers: Frank Rosenblatt Who coined the term Machine Learning in 1959? Marvin Minsky John McCarthy Allen Newell Arthur Samuel Herbert Simon Frank Rosenblatt Accepted Answers: Arthur Samuel To which category of search strategies does Depth First Search belong? Brute-Force Search Strategies Informed (Heuristic) Search Strategies Local Search Algorithms Accepted Answers: Brute-Force Search Strategies In knowledge representation, the collections of definitions of basic entities and their relationships for a specific domain is referred to by a specific term. Which term? Epistemology Ontology Phenomenology Accepted Answers: Ontology The ´Logic Theorist´ developed in 1955 has been called ´The first Artificial Intelligence Program´. Which of the early AI scientist was one of the co-developers. Marvin Minsky John McCarthy John Holland Allen Newell Accepted Answers: Allen Newell On which calculus did John McCarthy base the LISP programming language? FUNCTIONAL CALCULUS DIFFERENTIAL CALCULUS LAMBDA CALCULUS Accepted Answers: LAMBDA CALCULUS What is MINIMAX? A Decision Rule that maximizes the potential win in a best case scenario A Decision Rule that minimizes the potential loss in a worst case scenario A Decision Rule that optimizes the balance potential loss and potential win in worst and best cases Accepted Answers: A Decision Rule that minimizes the potential loss in a worst case scenario What do you call a function that estimates the cost of a path from the current node in a search graph to the closest goal node? MINIMALISTIC OPPORTUNISTIC HEURISTIC Accepted Answers: HEURISTIC Week 2 : Assignment 2 What does the term Feature Extraction refer to? Deriving a feature from an analogue input like an image Making a selection of a subset of features from a featureset Creating a new set of features by transformation of a featureset Accepted Answers: Creating a new set of features by transformation of a featureset ´Which important class of machine learning algorithms X is defined as follows? X is concerned with how agents take actions in an environment and changing state so as to maximize some notion of cumulative reward. Explanation-based learning Inductive logic programming Deep learning Similarity-based learning Reinforcement learning Accepted Answers: Reinforcement learning What is meant by a NEAR MISS in Machine Learning A misclassification of an example in Conceptual Clustering A non optimal chosen hypotheses in supervised learning due to a non optimal selction and ordering of examples An erroneous action in Reinforcement learning A deliberately choosen negative examples close to positive examples Accepted Answers: A deliberately choosen negative examples close to positive examples Choose the most adequate pair of characteristics of a feature that forms the basis for removing this feature? Irrelevance and High Feature value Cardinality High Feature value Cardinality and Redundancy Irrelevance and Redundancy High Feature value cardinality and Non-informativeness Accepted Answers: Irrelevance and Redundancy Learning Bias is one of the core concepts in the classical frameworks for Inductive Learning. Which of the following interpretations is the most adequate for capturing that concept? Deliberately using a very simplified and restricted Hypothesis Language Deliberately choosing particular subsets of examples as the basis for learning Embed background knowledge in the definition of the hypothesis language. Accepted Answers: Embed background knowledge in the definition of the hypothesis language. What is the term best suited for the situation depicted in the figure below, where the green line estimates the classification of the instances? Linear separability Overfitting Linear non-separability Underfitting Accepted Answers: Overfitting A taxonomy is a directed acyclic graph (DAG) under the Specialization relation. Which of the following Structures is NOT a valid DAG Accepted Answers: A taxonomy is a category structure, typically a hierachical one. Normally features are attributed to all categories on the different levels of the taxonomy. In this context, what is meant by INHERITANCE? That a leaf (terminal) category is characterized by the aggregation of attributes of itself and all its ancestor categories. That a category is characterized by the same attributes as its parent category only. That a category is characterized by the same attributes as its sibling categories. Accepted Answers: That a leaf (terminal) category is characterized by the aggregation of attributes of itself and all its ancestor categories. For everyday entities such as tools, clothes, furniture, fruits, pets etc, a typical taxonomy (category structure) has three levels of categories : superordinate, basic and subordinate. What is the primary characteristic of the subordinate level. It is the most abstract level with the fewest features It is the level with the highest density of attributes and the most common sense and everyday character of features It is the most concrete level with detailed features Accepted Answers: It is the most concrete level with detailed features For everyday entities such as tools, clothes, furniture, fruits, pets etc, a typical taxonomy has three levels: superordinate, basic and subordinate. Which of the following Categories in a fruit taxonomy would normally be characterized as SUPERORDINATE? Fruit Apple Macintosh Accepted Answers: Fruit Which properties does a Penguin have according to the above taxonomy with a standard specialization relation? Cannot Fly Cannot Fly, Has a beak, Can control body temperature in relation to environment, Has a spinal cord, Can move Cannot Fly, Has a beak, Can control body temperature in relation to environment, Has a spinal cord Cannot Fly, Has a beak, Can control body temperature in relation to environment Cannot Fly, Has a beak Accepted Answers: Cannot Fly, Has a beak, Can control body temperature in relation to environment, Has a spinal cord, Can move Week 3 : Assignment 3 How are the 14 dataitems in the provided dataset distributed on the leafs of the decision tree above? a=2 b=3 c =4 d=2 e=3 a=3 b=2 c =4 d=3 e=2 a=3 b=2 c =3 d=2 e=2 Accepted Answers: a=3 b=2 c =4 d=3 e=2 What is the normal notation for the probability of the occurence of one event A given another event B ? P(B) AND P(A) P(B,A) P(A)<-P(B) P(A|B) Accepted Answers: P(A|B) Which is the correct description of the following Bayesian Network in terms of basic subnetworks? Sequence, Convergence, Divergence Sequence, Divergence, Convergence Divergence, Sequence, Convergence Divergence, Convergence, Sequence Convergence, Sequence, Divergence Convergence, Divergence, Sequence Accepted Answers: Convergence, Sequence, Divergence A version of bayes rule: P(cause|effect) = P(effect|cause)* P(cause)/P(effect) In this example effect = person has a fever cause = person has a flu Prior probabilities: P(cause) = 0.0001, P(effect)= 0.002 P(effect|cause) = 0.8 What is the value of P(cause|effect) 0.08 0.016 0.04 Accepted Answers: 0.04 Which of the following pairs of steps is not consistent with the normal sequence of steps in Genetic Algorithms? Generation, Selection, Selection, Evaluation Evaluation, Mutation Evaluation, Reproduction Reproduction, Mutation Accepted Answers: Evaluation, Mutation Two chromosomes A B 01110000 10001111 A two point crossover operation in-between positions 1 and 8 should be performed to give two new chromosomes. Which constellation is the crossover result? 10001111 01110000 00001110 11110001 11110001 00001110 Accepted Answers: 00001110 11110001 What is meant by The Bucket Brigade Phase of a Classifier system based on Genetic Algorithms? The Bucket Brigade phase involves an algorithm for handover of results forwards along a sequence of rule applications. The Bucket Brigade phase involves an algorithm for handover of results forwards between two steps in a sequence of rule applications. The Bucket Brigade phase involves an algorithm for attributing credit and blame backwards along a sequence of rule applications. The Bucket Brigade phase involves an algorithm for attributing credit and blame backwards between two steps in a sequence of rule applications. Accepted Answers: The Bucket Brigade phase involves an algorithm for attributing credit and blame backwards along a sequence of rule applications. What is the term for the points of connectivity between neurons? AXON DENDRITE SOMA SYNAPSE Accepted Answers: SYNAPSE Which is the best estimate of the number of neurons in the human brain? 1 Billion 10 Billions 100 Billions 1000 Billions Accepted Answers: 100 Billions What is the output value Y for the above artificial neuron? 0.9 0.97 0.72 Accepted Answers: 0.72 Week 4 : Assignment 4 The different data-driven search approaches to induction require different storage policies. What does the Breadth First Search algorithm have to store? All positive examples + All negative examples + All hypotheses All positive examples + all hypotheses All negative examples + all hypotheses All hypotheses Accepted Answers: All negative examples + all hypotheses An inductive system learns from unordered pairs of dataitems of the form (size, color, animal) where size= (large or small), color=(black or brown) and animal=(cat or dog). How does the set of hypotheses look like after having handled the following three instances: Common Instructions for Q2-Q4 Instance 1. { (Large Black Dog) (Small Brown Cat) } Instance 2. { (Large Brown Cat) (Small Black Dog)} Instance 3. { (Large Brown Dog) (Small Brown Dog) } positive positive negative Using Depth First Search CBH3 ={ (Large ? ?) (Small ? Cat) } CBH3={ (Large ? Dog) (Small ? ? ) } CBH3=(? Black Dog) (? Brown Cat)} Accepted Answers: CBH3=(? Black Dog) (? Brown Cat)} Using Breadth First Search S3 = { (? Black Dog ) (? Brown Cat) } S3 ={ (Large ? ?) (Small ? Cat) } S3={ (Large ? ?) (Small ? ? ) } Accepted Answers: S3 = { (? Black Dog ) (? Brown Cat) } Using the Version space approach S3 ={(Large ? ?) (Small ? Cat) } G3 = { (? Black ?) (? ? Cat) } { (? ? Cat) (? Brown ?) } S3 ={ (? Black Dog) (? Brown Cat) } G3 = { (? Black ?) (? ? ?) } { (? ? Cat) (? ? ?) } S3 ={S3={ (Large ? ?) (Small ? ? ) } G3 = { (? Black ?) (? Brown ?) } { (? ? Cat) (? ? Dog) } Accepted Answers: S3 ={ (? Black Dog) (? Brown Cat) } G3 = { (? Black ?) (? ? ?) } { (? ? Cat) (? ? ?) } How does the set of hypotheses look like after having handled the following three instances: Common Instructions for Q5-Q7 Instance 1. { (Large Black Dog) (Small Brown Cat) } Instance 2. { (Large Brown Cat) (Small Black Dog)} Instance 3. { (Large Brown Dog) (Small Brown Dog) } Using Depth First Search CBH3 ={ (Large ? ?) (Small ? Cat) } CBH3={ (Large ? Dog) (Small ? ? ) } CBH3=(? Black Dog) (? Brown Cat)} Accepted Answers: CBH3=(? Black Dog) (? Brown Cat)} positive positive negative Using Breadth First Se S3 ={ (Large ? ?) (Small ? Cat) } S3 = { (? Black Dog ) (? Brown Cat) } S3={ (Large ? ?) (Small ? ? ) } Accepted Answers: S3 = { (? Black Dog ) (? Brown Cat) } Using the Version space approach S3 ={(Large ? ?) (Small ? Cat) } G3 = { (? Black ?) (? ? Cat) } { (? ? Cat) (? Brown ?) } S3 ={ (Large ? ?) (Small ? ? ) } G3 = { (? Black ?) (? Brown ?) } { (? ? Cat) (? ? Dog) } S3 ={ (? Black Dog) (? Brown Cat) } G3 = { (? Black ?) (? ? ?) } { (? ? Cat) (? ? ?) } Accepted Answers: S3 ={ (? Black Dog) (? Brown Cat) } G3 = { (? Black ?) (? ? ?) } { (? ? Cat) (? ? ?) } Which of the following statements is TRUE for a decision tree? An attribute with lower information gain should be preferred to other attributes. The entropy of a node typically decreases as we go down a decision tree. A Decision tree is an example of a linear classifier. The Entropy of a set increases with its purity. Accepted Answers: The entropy of a node typically decreases as we go down a decision tree. What is the entropy for a decision tree data-set with 6 positive and 4 negative examples. 0.58 0.840 0.41 0.97 Accepted Answers: 0.97 What is the value of the Information Gain in the following partitioning?: 0.26 0.38 0.42 0.18 Accepted Answers: 0.38 For which values of k is the query instance classified as RED by the k-nearest neighbour algorithm? 3 and 7 1 and 3 3 and 5 1 and 7 Accepted Answers: 3 and 7 For the two feature vectors (1,0,1,0,0,0,1) and (0,0,1,1,1, 1,1). What is the Manhattan distance between the two vectors? 5 3 2 4 Accepted Answers: 4 Consider the following two vectors: X = [4, 0, 0, 5, 1, 0, 0] Y = [5, 5, 4, 0, 0, 0, 0] What is the cosine similarity between X and Y?: 0.53 0.43 0.17 0.38 Accepted Answers: 0.38 Consider a two-dimensional non linearly separable input space and the following feature mapping onto a linearly separable three dimensional space: X: = F : ( x1, x2 ) -> (x1^2, x2^2, sqrt 2*x1*x2) Z: = F : ( z1, z2 ) -> (z1^2, z2^2, sqrt 2*z1*z2) K (X,Z)= the Kernel or Similarity function in the three dimensional space = the inner product of X and Z in that space : |X,Z| Calculate K(X,Z) = the inner product of X and Z above for the two two-dimensional vectors of (3,2) and (2,3) in the input space 6 144 124 12 Accepted Answers: 144 Partitioning–based clustering is one of the approaches to Cluster Analysis. Which of the following specific techniques falls in this category? CLIQUE COBWEB K-means Chameleon DBSCAN Accepted Answers: K-means In hierarchical-based clustering, it is typical to work with Distance Matrices between data items and so called Proximity Matrices between clusters. A distance Matrix for 6 data items is given below. We assume three clusters: A-B, C-D and E-F. How does the corresponding Proximity Matrix between these clusters look like? Accepted Answers: Week 5 : Assignment 5 Explanation Based Learning typically takes a predefined Domain Theory and modifies it in such a way that new more complex rules are created as specific combinations of several existing rules. The goal is to make problemsolving more efficient and the rule formation process is guided by the problem examples considered for training. As part of the generalization process for the newly constructed rules a specific kind of pattern matching technique is used. Which is the established term for this technique? Variable matching Resolution Recursion Unification Accepted Answers: Unification Domain Theory: Safe to travel(P,A,C) :- Healthy(P), Error free(A), No epidemics (C). Error free(A):- Passed an inspection (A). Errorfree(A):- Worked when driven same day (A). Healthy(P) :- Had no symptoms for three weeks (P). Healthy(P) :- Passed a health test(P) Passed a health test (X):- On line test (X). Passed a health test (X):- Clinic test (X). Passed an inspection (X):- Company inspection(X). Passed an inspection(X):- Authority inspection(X). No epidemics (C):-No WHO alert (C). Operational predicates: Worked when driven last same day, Had no symptoms for three weeks, On line test, Clinic test, Company inspection, Authority inspection, No WHO alert Training Example: Safe to travel (John, IndiaAir2020, India) Passed a health test(John), Authority inspection(IndiaAir 2020), No WHO alert (India). Goal concept: Safe to travel(P, A, C) Explain and generalize from Safe To Travel( John, IndiaAir2020, India). How does the resulting rule look like after applying an EBL algorithm Safe to travel(John,IndiaAir2020,India)) :- Healthy(John), Error free(IndiaAir2020), No epidemics (India). Safe to travel(John, IndiaAir2020, India):-Passed a health test (John), Authority inspection(IndiaAir 2020), No WHO alert (India). Safe to travel(P, A, C):-Passed a health test (P), Authority inspection(A), No WHO alert (C). Safe to travel(P,A,C)) :- Healthy(P), Error free(A), No epidemics (C). Accepted Answers: Safe to travel(P, A, C):-Passed a health test (P), Authority inspection(A), No WHO alert (C). Which of the following inference schemes correspond to Induction? A B C Accepted Answers: B ILP algorithm constructs a lattice of clauses fabricated from the building blocks (predicates, variables, constants) specific to each induction case and navigates that lattice in order to establish a Hypothesis set that fits the current set of examples. ILP can work in two fashions: Top Down and Bottom Up, establishing specialization or generalization relations in the lattice. Case Hypotheses: insect(X). Positive Examples: insect(silverfish), insect(green drake), insect(cricket) Background knowledge: Has six legs (fire brat), Has six legs (green drake), Have wings(green drake), Has six legs (cricket), Have wings(cricket), Can fold wings(cricket), How does the Hypotheses set of clauses look like after a first iteration of a Top Down ILP algorithm based on the first positive example insect(fire brat) ? insect(X):-Has six legs(X). insect(X):-Has six legs(X), insect(X):-Has wings(X), insect (X):-Can fold wings(X). insect(fire brat):-Has six legs(fire brat). Accepted Answers: insect(X):-Has six legs(X), insect(X):-Has wings(X), insect (X):-Can fold wings(X). Instance Based Learning techniques are typically used in one of the main phases of Case Based Reasoning. Which? RETRIEVE REUSE RETAIN REVISE Accepted Answers: RETRIEVE Choose the correct sequence of phases: X Y Z W = RETRIEVE REUSE RETAIN REVISE X Y Z W = RETRIEVE REVISE REUSE RETAIN X Y Z W = RETAIN REVISE REUSE RETRIEVE X Y Z W = RETAIN RETRIEVE REVISE REUSE Accepted Answers: X Y Z W = RETAIN RETRIEVE REVISE REUSE A dynamic programming approach to calculating the shortest distance from S to T gives the result= 14. Which result would a greedy forward search give? 26 21 14 Accepted Answers: 26 Monte Carlo Reinforcement Learning MC methods learn directly from samples of complete episodes of experience. • • First-visit MC: average returns only for first time s is visited in an episode. Every-Visit MC: average returns for every time s is visited in an episode. Simplified Algorithm for first-visit Monte Carlo 1. Initialize state-value functions. Return list(s) ← empty list 2. For all s in all episodes E 3. return = sum of rewards r in episode from state to termination. 4. if this is the first occurrence of this state s add the calculated return to the returns list (s). 5. Calculate values of all s as average over all return lists (s). Example: An undiscounted Markov Reward Process with two states A and B. The Transition matrix and reward function are unknown. Two sample episodes: E1 A->A, r=2 A->B, r=7 B->A, r= -7 A->B, r=7 B->terminate, r= -2. E2 B->A, r= -7 A->B, r=7 B->terminate, r = -2 Which are the estimated values of A and B after considering the two episodes using the simplified first-visit algorithm? V(A)= 6,V(B)= -2 V(A)= 7,V(B)= -2 V(A)= 6,V(B)= 2 V(A)= 7,V(B)= 2 Accepted Answers: V(A)= 6,V(B)= -2 In this example, we have a 4X5 board. The start state is marked above with a blue dot. The rewards for all positions are also marked (rewards being 1, -4 and 4). The positions on the rim of the board are terminal states. The board elements are labelled from 1 to 20 as in the mid figure. Moves are N, S, E and W. We consider two episodes: 1. 12:E->13:E->14:E->15 and 2. 12:N->7:E->8:E->9:E->10. What are the updated versions of the Q elements for state 12, 13, 14, 7,8 and 9 after considering these two episodes? α=γ=1α=γ=1. 12,E=0 13,E=0, 14,E= -4 12,N=0, 7,E=0 8,E=0 9,E = 4 12,E=1 13,E=1 14,E= -6 12,N=1 7,E=1 8,E=1 9,E= 6 12,E=1 13,E=2 14,E= -6 12,N=1 7,E=2 8,E=3 9,E= 6 12,E=1 13,E=2 14,E= -2 12,N=1, 7,E=2 8,E=3 9,E=7 Accepted Answers: 12,E=1 13,E=1 14,E= -6 12,N=1 7,E=1 8,E=1 9,E= 6 Week 6 : Assignment 6 One of the logical operators described by the following truthtables is infamous for being not linearly separable and therefore not possible to implement by a single perceptron. Which? Accepted Answers: How will the final weight vector look like when all data-items are processed 1011 0 0 -1 0 0 -1 0 0 1010 None of the above Accepted Answers: 0 -1 0 0 In a feedforward ANN, the so called Delta Rule is based on a particular error measure based on the target values and output values for the outputs of the ANN. When this particular error measure is applied for a single neuron, what is the error measure for a target value of 12 and an output value of 14. E= ½ * ( T-Y) ^2 4 2 0.5 16 Accepted Answers: 2 How will the final weight vector look like when all data-items are processed? -0.25 0 0 0.25 0 -0.25 0.25 0 0.5 0 0.75 -0.35 -0.25 0.5 -0.25 0.25 None of the above Accepted Answers: -0.25 0.5 -0.25 0.25 What is the output value Y from neuron # 6 ? 0.118 0.018 1.118 1.018 Accepted Answers: 0.018 What is the backpropagated error sensitivity at neuron # 1 ? 0.0144 0.040 0.004 0.014 Accepted Answers: 0.004 What is the updated weight for connection between input x1 and neuron #1? 0.104 0.140 0.1144 0.114 Accepted Answers: 0.104 Select the main reason for extending standard RNNs to so called Bidiectional RNNs: being able to handle multiple levels of abstractions within the domains of analysis being able to get information from past (backwards) and future (forward) states simultaneously Mastering the vanishing gardient problem Mastering the long dependency problem Accepted Answers: being able to get information from past (backwards) and future (forward) states simultaneously Which RNN network structure would best fit a text analysis task, where the occurences of references to a specific kind of event are searched for? Accepted Answers: Hebb’s Law can be represented in the form of two rules: 1. 2. If two neurons on either side of a connection (synapse) are activated synchronously, then the weight of that connection is increased. If two neurons on either side of a connection (synapse) are activated asynchronously, then the weight of that connection is decreased. Learning according to Hebb’s Law is primarily consistent with one of the following kinds of learning Reinforcement learning Un-supervised learning. Supervised learning Accepted Answers: Un-supervised learning. How will the final weight vector look like when the training data has been processed? 3223 2233 3322 3232 None of the above Accepted Answers: 3322 Hopfield Networks use a method borrowed from Mettalurgy to let the network states settle. The method can be characterized as of below: • • • Heat the solid state metal to a high temperature Cool it down very slowly according to a specific schedule. If the heating temperature is sufficiently high to ensure random state and the cooling process is slow enough to ensure thermal equilibrium, then the atoms will place themselves in a pattern that corresponds to the global energy minimum of a perfect crystal. What is the name of the method Casting Crystallization Annealing Welding Forging Accepted Answers: Annealing What is the resulting state vector ? 1 -1 1 1 1 -1 -1 1 1 111 Accepted Answers: 111 how does the feature map look like after the convolution of input array A with the filter F? Accepted Answers: Week 7 : Assignment 7 A movement within Psychology has also influenced re-inforcement Learning. Which? Behaviourism Cognitivism Reductionism Structuralism Functionalism Accepted Answers: Behaviourism A movement within Psychology is consistent with an abstract model of the Mind in terms of systems of Symbols, Concepts and Logically related inferences. Which movement? Reductionism Behaviourism Structuralism Cognitivism Functionalism Accepted Answers: Cognitivism A movement within Psychology is consistent with the motivations behind the sub-symbolic approach of artificial neural networks. Which movement? Cognitivism Behaviourism Structuralism Reductionism Functionalism Accepted Answers: Reductionism Learning of Language and language engineering in CS and AI is strongly inspired by the structuralist view on Linguistics. Who was the front figure for this view? Bloomfield Hockett Lakoff Chomsky Accepted Answers: Chomsky What is the neuro science term for the property of a system to adapt, reconfigure and consolidate its structure? Locality Plasticity Equipotentiality Mass action Specificity Holism Accepted Answers: Plasticity The work by Donald Hebb exemplifies an integration of two of the principles below. Which two? Locality, Specificity Equipotentiality, Mass Action Locality, Holism Mass action, Plasticity Resilience, Plasticity Resilience, Equipotentiality Accepted Answers: Locality, Holism The Classical view of categories is characterized by the following statements. One statement is wrong. Which? Categories are arbitrary. Typically the ways we choose to categorize objects are culturally and linguistically based The levels in a hierarchy (lattice) of categories have a different status , with the middle layer being most fundamental to everyday thinking (basic level) in contrast to more abstract (superordinate) and more detailed (subordinate) levels Categories have defining attributes (features/value combinations). All members share them, No nonmembers share them and there are no overlap between members and nonmembers The Intension (set of attributes) determines the extension of a category (its members) The member space has no internal structure and all members are regarded as equal and first class citizens. Accepted Answers: The levels in a hierarchy (lattice) of categories have a different status , with the middle layer being most fundamental to everyday thinking (basic level) in contrast to more abstract (superordinate) and more detailed (subordinate) levels The Modern or natural view of a categories is characterized by the following statements. One statement is wrong. Which? Categories are in many case motivated by properties of our sensory system and the world surrounding us. Categories are built around central members or prototypes defined by sharing more attributes with other members than with nonmembers. Members are graded based on typicality Obviously typicality generates a topology of the member space The Intension (set of attributes) crisply determines the extension of a category (its members) The levels in a hierachy (lattice) of categories have a different status , with the middle layer being most fundamental to everyday thinking (basic level) in contrast to more abstract (superordinate) and more detailed (subordinate) levels Accepted Answers: The Intension (set of attributes) crisply determines the extension of a category (its members) The Entropy measure is used to guide the optimal setup of a decision tree. The measure is inspired by entropy in physics which measures the: Molecular regularity Molecular disorder Molecular non-regularity Molecular order Accepted Answers: Molecular disorder The Energy function in Hopfield Networks is inspired by a model of ferromagnetism in Statistical Mechanics. The model is named after a person. Which? Lenz Hopfield Ising Onsager Accepted Answers: Ising Which Dataset Repository is primarily a gate-way to other specific repositories for Data-sets Imagenet Kaggle Repository Google trends Amazon reviews Accepted Answers: Kaggle Repository Deep Mind is a company focussing on Artificial Intelligence being related to another large company. Which? Amazon Microsoft SAP Google Facebook Huawei Accepted Answers: Google What does the abbreviation API stands for? Advanced Programming Interface Application Programming Interface Advanced Project Integrator Accepted Answers: Application Programming Interface Which of the following tools are NOT Open Source? Watson TensorFlow Python Wolfram Caffe Accepted Answers: Wolfram Which of these systems for technical computations is primarily aimed for numerical calculations in contrast to symbolical? Mathematica Macsyma MATLAB Maple Reduce Accepted Answers: MATLAB Which of the programming languages below can be characterized as a ´Multi Paradigm Language´ in the sense that it supports both imperative, functional, object oriented programming and indirectly logic oriented programming? Pascal C C++ C# LISP Haskel Prolog Python Wolfram R Java Java Script Visual Basic Accepted Answers: Python One popular tool in a currently typical machine learning toolbox, provides collaborative environments, where you can freely combine human-readable narratives with computer-readable codes, equations and visualizations. Pick this tool from the list below: Caffe Pytorch Theano Jupyter Notebooks Tensorflow Microsoft Cognitive Toolkit Accepted Answers: Jupyter Notebooks Many Machine Learning tools are specialized for support of ANN, in particular Deep Learning. Examples are given below. One of the listed tools have an ambition for support for Machine Learning techniques in general. Which one? Intel deep learning cloud Microsoft Cognitive Toolkit Tensorflow Caffe Accepted Answers: Tensorflow Which two of the following platforms are primarily NOT aimed for Cloud/Distributed Computing? Amazon Webservices Apple iCloud SAP Leonardo Microsoft Azur Hadoop MapReduce Apache SPARK Accepted Answers: SAP Leonardo The dominating hardware providers are launching Systems on a Chip (SoCs) that can provide the basic functionalities for Advanced Driver Assist Systems (ADAS) on various levels (2-5). The core of the functionality of the SoCs is the image recognition enabled by Machine Learning (Deep Learning) Which is the correct coupling between the hardware providers and ADAS/SoCs? Qualcomm - Drive Xavier/Pegasus Nvidia - Snapdragon Ride Qualcomm - Drive Xavier/Pegasus Nvidia - Mobileeye Qualcomm - Snapdragon Ride Nvidia - Drive Xavier/Pegasus Intel - Mobileye Intel - Snapdragon Ride Qualcomm - Snapdragon Ride Xavier/Pegasus Nvidia - Mobileye Qualcomm - Mobileye Xavier/Pegasus Nvidia - Snapdragon Ride Qualcomm - Mobileye Nvidia - Drive Xavier/Pegasus Intel - Mobileye Intel - Drive Intel - Drive Intel - Snapdragon Ride Accepted Answers: Qualcomm - Snapdragon Ride Nvidia - Drive Xavier/Pegasus Intel - Mobileye