Inductive Bias-based Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters