Birla Institute of Technology & Science, Pilani
Work Integrated Learning Programmes Division
Second Semester 2019-20
M.Tech. (Data Science and Engineering)
Comprehensive Examination (Regular)
Course No. : DSECLZG565
Course Title : MACHINE LEARNING
Nature of Exam : Open Book
Weightage : 40%
Duration : 2 Hours
Date of Exam: July 12, 2020 Time of Exam: 10:00 AM – 12:00 PM
Note: Assumptions made if any, should be stated clearly at the beginning of your answer.
Question 1. [3+3+2+3=11 marks]
Suppose you flip a coin with unknown bias θ; P(x = H | θ) = θ, five times and observe the outcome as HHHHH.
What is the maximum likelihood estimator for θ? [1 mark]
Would you think this is a good estimator? If not, why not? [2 marks]
A disease has four symptoms and past history of a physician has the following data. Use Naïve Bayes classifier to predict whether patient has disease for new patient data symptoms. [2 marks]
Can logistic regression be applied to multi-class classification problem?
State true or false [1 mark]
Why are log probabilities computed instead of probabilities? [1 mark]
To make computation consistent
To factor into smaller values of probabilities
To factor into larger values of probabilities
None of these
1. In a linear relationship y = m*x+b, y is said to be dependent on x when: [1 mark]
m is closer to zero.
m is far from zero.
b is far from zero.
b is closer to zero.
2. In a linear relationship between y and x, y is not dependent on x when: [1 mark]
The coefficient is closer to zero.
The coefficient is far from zero.
The intercept is far from zero.
The intercept is closer to zero.
3. In a linear regression model y= w0 + w1*x, if true relationship between y and x is
y = 7.5 +3.2x, then w0 acts as, [1 mark]
Intercepts
Coefficients
Estimators
Residuals
Question 2.
The following backpropagation network uses an activation function called leaky ReLU that generates output = input, if input >= 0, and 0.1 * input if output < 0. At a particular iteration, the weights are indicated in the following figure. Training error is given by E = 0.5*(t-y)2 where t is the target output and y is the actual output from the network. What are the outputs of hidden nodes and actual final output y from the network with x1=x2=1? What will be the weights w31 and w12 in the next iteration with learning rate = 0.1, x1=x2=1, and target output t=0? Assume derivative of activation function = 0 at input = 0, and zero bias at all nodes. [1+1+1+1.5+2.5=7 marks]
Question 3.
Consider training a boosting classifier using decision stumps on the following data set:
1. Circle the examples which will have their weights increased at the end of the first iteration? [2 marks]
2. How many iterations will it take to achieve zero training error? Explain. [3 marks]
A new mobile phone service chain store would like to open 20 service centres in Bangalore. Each service centre should cover at least one shopping centre and 5,000 households of annual income over 75,000. Design a scalable algorithm that decides locations of service centres by taking all the aforementioned constraints into consideration [5 marks]
Question 4.
In a clinical trial, height and weight of patients is recorded as shown below in the table. For incoming patient with weight = 58 Kg and Height = 180 cm, classify if patient is Under-weight or Normal using KNN algorithm with When K = 3? [5 marks]
Question 5.
Considering the following data, Let x1, x2 be the features
Positive Points: {(3, 1), (5, 2), (1, 1), (2, 2), (6, -1)}
Negative Points: {(-3, 1), (-2, 2), (0, 3), (-3, 4), (-1, 5)}
Derive an equation of hyperplane and compute the model parameters. [7 marks]