SEARCH

SEARCH BY CITATION

References

  • Abbeel, P., A. Coates, M. Quigley and A.Y. Ng (2007) An application of reinforcement learning to aerobatic helicopter flight, in Proceedings of the Advances in Neural Information Processing, NIPS'07.
  • Ahn H., M. Gumus and P. Kaminsky (2007) Pricing and manufacturing decisions when demand is a function of price in multiple period, Operations Research, 55, 10391057
  • Banerjee, B. and P. Stone (2007) General game learning using knowledge transfer. in The 20th International Joint Conference on Artificial Intelligence, 672677.
  • Bazzan, A.L.C., D. De oliveira and B.C. Da Silva (2010) Learning in groups of traffic signals, Engineering Applications of Artificial Intelligence, 23, 560568
  • Chaharsooghi, S.K., J. Heydari and S.H. Zegordi (2008) A reinforcement learning model for supply chain ordering management: an application to the beer game, Decision Support Systems, 45, 949959.
  • Chan L.M.A., Z.J. Max Shen, D. Simchi-Levi and J. Swann (2004) Coordination of Pricing and Inventory Decisions: A Survey and Classification, in Handbook of Quantitative Supply Chain Analysis: Modelling in the E-Business Era, Boston: Kluwer.
  • Darken, C., J. Chang and J. Moody (1992) Learning Rate Schedules for Faster Stochastic Gradient Search, Neural Networks for Signal Processing, in Proceedings of the 1992 IEEE Workshop, Piscataway, NJ: IEEE Press.
  • Eliashberg, J., Steinberg, R., (1993) Marketing-Production Joint Decision-Making. In: Eliashberg, J., Lilien, G.L. (Eds.), Handbooks in Operations Research and Management Science, Vol. 5. North Holland: Amsterdam, 827880.
  • Elmaghraby, W. and P. Keskinocak (2003) Dynamic pricing in the presence of inventory considerations, Management Science, 49, 12871309.
  • Ferreira, K.D. and D.D. Wu (2011) An integrated product planning model for pricing and bundle selection using Markov decision processes and data envelope analysis, International Journal of Production Economics, 134-1, 95107.
  • Han, W., L. Liu and H. Zheng (2008) Dynamic pricing by multi-agent reinforcement learning, International Symposium on Electronic Commerce and Security, 226229.
  • Jiang, C. and Z. Sheng (2009) Case-based reinforcement learning for dynamic inventory control in a multi-agent supply-chain system, Expert Systems with Applications, 36, 65206526.
  • Kaelbling L.P., M.L. Littman and A.W. Moore (1996) Reinforcement learning: a survey, J. Artificial Intelligence Research, 4, 237285.
  • Kwon, I.-H., C.O. Kim, J. Jun and J.H. Lee (2008) Case-based myopic reinforcement learning for satisfying target service level in supply chain, Expert Systems with Applications, 35, 389397.
  • Li X. and J. Wang, R. Sawhney (2012) Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems, European Journal of Operational Research, 221-1, 99109.
  • Littman, M.L. (1994) Markov Games as a Framework for Multi-Agent Reinforcement Learning, in Proceedings of the Eleventh International Conference on Machine Learning, San Mateo, CA: Morgan Kaufman, 157163.
  • Raju, C.V.L., Y. Narahari and K. Ravikumar (2003) Reinforcement learning applications in dynamic pricing of retail markets, in Proceedings of the IEEE International Conference on E-Commerce (CEC'03), 339.
  • Ravulapati, K.K., J. Rao and T.K. Das (2004) A reinforcement learning approach to stochastic business games, IIE Transactions, 36, 373385.
  • Serel D.A. (2008) Inventory and pricing decisions in a single-period problem involving risky supply, International Journal of Production Economics, 116, 115128.
  • Shoham Y., R. Powers and T. Grenager (2007) If multi-agent learning is the answer, what is the question, Artificial Intelligence, 171, 365377.
  • Sutton, R.S. and A.G. Barto (1998) Reinforcement Learning: An Introduction, Cambridge, MA: MIT Press.
  • Talluri, K. and G. Van Ryzin (2004) Revenue management under a general discrete choice model of consumer behaviour, Management Science 50, 1533.
  • Tesauro G. and J.O. Kephart (2002) Pricing in agent economies using multi-agent Q-learning, Autonomous Agents and Multi-Agent Systems, 5, 289.
  • Tesauro G., J. Kephart, (1999) Pricing in Agent Economies Using Multi-Agent Q-Learning, in Proceedings of Fifth European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty, London, UK.
  • Train K. (2009) Discrete Choice Methods with Simulation. Cambridge: Cambridge University Press, 2nd ed.
  • Watkins, C.J.C.H. (1989) Learning from delayed rewards. PhD thesis, Cambridge University, Cambridge, England.
  • Wei J. and J. Zhao (2011), Pricing decisions with retail competition in a fuzzy closed-loop supply chain, Expert Systems with Applications, 38, 1120911216.
  • Wu D., X. Kefan, L. Hua, Z. Shi and D.L. Olson (2010) Modeling technological innovation risks of an entrepreneurial team using system dynamics: an agent-based perspective, Technology Forecasting and Social Change, 77, 857869.
  • Wu D. and C.G. Lee (2010) Stochastic DEA with ordinal data applied to a multi-attribute pricing problem, European Journal of Operational Research, 207, 16791688.
  • Xie, M. and J. Chen (2004) Studies on horizontal competition among homogeneous retailers through agent based simulation, Journal of Systems Science and Systems Engineering, 13, 490505.
  • Yano, C.A., and S.M. Gilbert (2004). Coordinated Pricing and Production/Procurement Decisions: A Review. In: Chakravarty, A., Eliashberg, J. (Eds.), Managing Business Interfaces: Marketing, Engineering and Manufacturing Perspectives. Kluwer Academic Publishers, Boston, MA.