Energy balancing in multi-hop Wireless Sensor Networks: an approach based on reinforcement learning