Research Article
Adaptive quality of service-based routing approaches: development of neuro-dynamic state-dependent reinforcement learning algorithms
Article first published online: 15 NOV 2006
DOI: 10.1002/dac.858
Copyright © 2006 John Wiley & Sons, Ltd.
Issue
1099-1131/asset/cover.gif?v=1&s=6803fd5765ad45fe5d94eb628e0c9b9deb3b4fe8)
International Journal of Communication Systems
Volume 20, Issue 10, pages 1113–1130, October 2007
Additional Information
How to Cite
Mellouk, A., Hoceïni, S. and Amirat, Y. (2007), Adaptive quality of service-based routing approaches: development of neuro-dynamic state-dependent reinforcement learning algorithms. International Journal of Communication Systems, 20: 1113–1130. doi: 10.1002/dac.858
Publication History
- Issue published online: 13 SEP 2007
- Article first published online: 15 NOV 2006
- Manuscript Accepted: 7 SEP 2006
- Manuscript Revised: 22 JUN 2006
- Manuscript Received: 14 SEP 2005
- Abstract
- References
- Cited By
Keywords:
- shortest-path routing;
- flow-based routing;
- N-best paths;
- neural networks;
- adaptive routing;
- neuro-dynamic state-dependent;
- reinforcement learning;
- traffic engineering
Abstract
In this paper, we propose two adaptive routing algorithms based on reinforcement learning. In the first algorithm, we have used a neural network to approximate the reinforcement signal, allowing the learner to take into account various parameters such as local queue size, for distance estimation. Moreover, each router uses an online learning module to optimize the path in terms of average packet delivery time, by taking into account the waiting queue states of neighbouring routers. In the second algorithm, the exploration of paths is limited to N-best non-loop paths in terms of hops number (number of routers in a path), leading to a substantial reduction of convergence time. The performances of the proposed algorithms are evaluated experimentally with OPNET simulator for different levels of traffic's load and compared with standard shortest-path and Q-routing algorithms. Our approach proves superior to classical algorithms and is able to route efficiently even when the network load varies in an irregular manner. We also tested our approach on a large network topology to proof its scalability and adaptability. Copyright © 2006 John Wiley & Sons, Ltd.

1099-1131/asset/olbannerleft.gif?v=1&s=e6a2deb39025c8e261611ec20dac96a89cac134e)