Policy Optimization for Markov Games: Unified Framework and Faster Convergence