• GPU;
  • molecular mechanics;
  • MMFF94;
  • cell lists;
  • Cuda

In this article, we describe an improved cell-list approach designed to match the Kepler architecture of General-purpose graphics processing units (GPGPU). We explain how our approach improves load balancing for the above algorithm and how warp intrinsics are used to implement Newton's third law for the nonbonded force calculations. We also talk through our approach to exclusions handling together with a method to calculate bonded forces and 1–4 electrostatic scaling using a single Cuda kernel. Performance benchmarks are included in the last sections to show the linear scaling of our implementation using a step minimization method. In addition, multiple performance benchmarks demonstrate the contribution of various optimizations we used for our implementations. © 2013 Wiley Periodicals, Inc.