The awake complexity of a distributed algorithm measures the number of rounds in which a node is awake. When a node is not awake, it is sleeping and does not do any computation or communication and spends very little resources. Reducing the awake complexity of a distributed algorithm can be relevant in resource-constrained networks such as sensor networks, where saving energy of nodes is crucial. Awake complexity of many fundamental problems such as maximal independent set, maximal matching, coloring, and spanning trees have been studied recently. In this work, we study the awake complexity of the fundamental distributed minimum spanning tree (MST) problem and present the following results. - Lower Bounds. 1. We show a lower bound of Omega(log n) (where n is the number of nodes in the network) on the awake complexity for computing an MST that holds even for randomized algorithms. 2. To better understand the relationship between the awake complexity and the round complexity (which counts both awake and sleeping rounds), we also prove a trade-off lower bound of (Omega) over tilde (n) (throughout, the (O) over tilde notation hides a polylog n factor and (Omega) over tilde hides a 1/(polylog n) factor) on the product of round complexity and awake complexity for any distributed algorithm (even randomized) that outputs an MST. Our lower bound is shown for graphs having diameter ranging from (Omega) over tilde(root n) to (Omega) over tilde (n). - Awake-Optimal Algorithms. 1. We present a distributed randomized algorithm to find an MST that achieves the optimal awake complexity of O(log n) (with high probability). Its round complexity is O(n log n). We note that by our trade-off lower bound, in general (in terms of n), this is the best round complexity (up to logarithmic factors) for an awake-optimal algorithm. 2. We also show that the O(log n) awake complexity bound can be achieved deterministically as well, by presenting a distributed deterministic algorithm that has O(log n) awake complexity and O(n log(5) n) round complexity. We also show how to reduce the round complexity to O(n log n log* n) at the expense of a slightly increased awake complexity of O(log n log* n). - Trade-Off Algorithms. To complement our trade-off lower bound, we present a parameterized family of distributed algorithms that gives an essentially optimal trade-off (up to polylog n factors) between the awake complexity and the round complexity. Specifically we show a family of distributed algorithms that find an MST of the given graph with high probability in (O) over tilde (D+ 2(k) + n/2(k)) round complexity and (O) over tilde (n/2(k)) awake complexity, where D is the network diameter and integer k is an input parameter to the algorithm. When k is an element of [max{[0.5 log n], [log D]}, [log n]], we can obtain useful tradeoffs. Our work is a step towards understanding resource-efficient distributed algorithms for fundamental global problems such as MST. It shows that MST can be computed with any node being awake (and hence spending resources) for only O(log n) rounds which is significantly better than the fundamental lower bound of (Omega) over tilde (Diameter(G) + root n) rounds for MST in the traditional CONGEST model, where nodes can be active for at least so many rounds.