Data Fundamentals (H)

1. An array has shape (13, 5). What would we call it?

2. Vectorised operations:

3. Arrays have fixed:

4. An (8,4) array has:

5. x[::2, 1:5] is a slice which indexes:

6. np.arange(1,5) produces the array:

7. x is (15,4). y is (15,). How would I add the values in y to each row of x?

8. x is shape (3,3). What does the operation x[:, [1,0,2]] do?

9. A strided array data structure means that:

10. Not a number (NaN) is represented in IEEE 754 as:

11. Which of these is not a IEEE 754 floating point exception?

12. x is (10,4) and y is (4,4). Which of these operations results in x and y joined into a (14,4) array?

13. A line is a:

14. Faceted and layered

15.

Which of these things is missing from this plot?

[Figure: The heart mass of cats, plotted against the cats' body weight]

16.

I want to compute

\( z = sum_i i^2 x_i^3 y_i \)

\(x\) and \(y\) are 1D vectors of the same shape. z will be a scalar. Which of these does that?

17. x is shape (100,5,5,8). What does np.einsum('ijkl->likj', x) result in?

18. The memory layout of numerical arrays is stored using:

19. \(\|\vec{x}\|_\infty\) could be computed by which operation?

20. Which of these operations is not defined over vectors in a topological vector space equipped with an inner product?

21. \(\vec{x}\bullet \vec{y}\) is zero if and only if:

22. Distances in high-dimension can be counter intuitive because:

23. The covariance matrix represents:

24. When rendering a graph with unsigned scalar values mapped to colours, what property should the colour map have?

25. Applying the linear map defined by the matrix \(A\) to the column vector \(\vec{x}\) should be written:

26. Repeatedly applying a matrix \(A\) to a random initial vector \(\vec{x}_0\), normalising after each step, will lead to:

27. If \(A\) is orthogonal, then:

28. The adjacency matrix of an undirected graph is:

29. In a stochastic matrix:

30. The determinant of a matrix is equal to:

31. I want to find the shape of an object, with constant surface area, that holds the least water. What is the objective function?

32. A convex constraint is equivalent to a restriction to a portion of the parameter space:

33. An objective function is nonconvex, iff:

34. The feasible set in an optimisation problem is:

35. In an approximation problem, we'd often have a loss function of the form:

36. The definition of an eigenvector is:

37. Simulated annealing uses what metaheuristic to help avoid getting trapped in local minima?

38. A hyperparameter of an optimisation algorithm is:

39. First-order optimisation requires that objective functions be:

40. The gradient vector \(\nabla L(\theta)\) is a vector which, at any given point \(\theta\) will:

41. Finite differences is not an effective approach to apply first-order optimisation because:

42. Ant colony optimisation applies which two metaheuristics to improve random local search?

43. For a multi-objective optimisation, Pareto optimality means that:

44. What property of a probability distribution always holds true?

45. Bayesians use probability as:

46. The conditional probability P(A|B) is defined to be: (\(\land\) means "and" and \(\lor\) means "or")

47. If I have a joint distribution over two random variables \(A\) and \(B\), \(P(A,B)\), how can I compute \(P(A)\)?

48. In an optimisation problem, a penalty function can be used to:

49. The entropy of a random variable \(H(X) = \sum_i -\log(P(x_i))P(x_i)\) is:

50. Bayes' Rule is:

51. The name for P(B|A) and P(A) in Bayes Rule are:

52. The expectation \(\mathbb{E}[X+1]\) for a discrete random variable \(X\) would be computed as:

53. Which of these is a statistic which is an estimator of a population parameter for a normal distribution?

54. Which of these is a nonparametric statistic?

55. The Nyquist limit \(f_n\) is equal to:

56. Decreasing the number of levels of amplitude quantization will have what affect on the sampled representation of a signal?

57. The exponential smooth is often used instead of a moving average because:

58. Aliasing is caused by sampling signals with:

59. Along with a way to evaluate the likelihood and prior at any parameter setting \(\theta\), what else does Metropolis-Hastings need to sample from the posterior distribution?

60. In medical device, if you had an initial heart/pulse rate p0 and a 1D vector of changes in pulse rates captured at evenly spaced intervals, delta_p, how would you compute p, the pulse rate at each of these times?