Unsupervised State Embedding & Aggregation Towards Scalable Reinforcement Learning