In this talk, we shall describe the NUMA atmospheric model, focusing in particular on its unified continuous/discontinuous (CG and DG) Galerkin numerical methods that are used to represent the spatial derivatives. We shall describe how these two methods are formulated in a unified approach and the advantages that this brings. We will also report on the progress in extending NUMA to using adaptive mesh refinement. Lastly, we will report on the scalability and performance of NUMA on the leadership computing facilities (LCF) of the Department of Energy where we have scaled NUMA to over 3 million MPI threads achieving a 99% strong scaling efficiency and on 2^14 GPUs on Titan at Oakridge achieving 94% weak scaling.