The policy gradient theorem