* add tf metrics
* comments
* fix network scopes
* add doc
* use format string
* fix trace level
* plot intermediate and final sgd stats
* add back a global step
* pong on pixels working (not cleaned up)
* make training compatible with all atari games
* cartpole runs
* Update documentation and usage for policy gradients.