A Machine Learning Specialist is implementing a full Bayesian network on a dataset that describes public transit in New York City.One of the random variables is discrete and represents the number of minutes New Yorkers wait for a bus given that the buses cycle every 10 minutes, with a mean of 3 minutes.Which prior probability distribution should the ML Specialist use for this variable?
A Machine Learning Specialist is preparing data for training on Amazon SageMaker.The Specialist is using one of the SageMaker built-in algorithms for the training.The dataset is stored in .CSV format and is transformed into a numpy.array, which appears to be negatively affecting the speed of the training.What should the Specialist do to optimize the data for training on SageMaker?