Provide the routing scheme for the wireless ad hoc networks is somewhat difficult problem. Here, the authors propose distributed adaptive opportunistic routing scheme for multi hop wireless ad hoc networks. The proposed scheme utilizes a reinforcement learning framework to opportunistically route the packets even in the absence of reliable knowledge about channel statistics and network model. This scheme is shown to be optimal with respect to an expected average per-packet reward criterion. The proposed routing scheme jointly addresses the issues of learning and routing in an opportunistic context, where the network structure is characterized by the transmission success probabilities.