Search Results

There are 13110 results for: content related to: A comparison of GPU strategies for unstructured mesh physics

  1. Iterative sparse matrix–vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems

    Concurrency and Computation: Practice and Experience

    Volume 25, Issue 4, 2013, Pages: 586–603, Bertil Schmidt, Hans Aribowo and Hoang-Vu Dang

    Version of Record online : 3 JUL 2012, DOI: 10.1002/cpe.2896

  2. Pricing derivatives on graphics processing units using Monte Carlo simulation

    Concurrency and Computation: Practice and Experience

    Volume 26, Issue 9, 25 June 2014, Pages: 1679–1697, L.A. Abbas-Turki, S. Vialle, B. Lapeyre and P. Mercier

    Version of Record online : 24 MAY 2012, DOI: 10.1002/cpe.2862

  3. Acceleration of option pricing technique on graphics processing units

    Concurrency and Computation: Practice and Experience

    Volume 26, Issue 9, 25 June 2014, Pages: 1626–1639, Bowen Zhang and Cornelis W. Oosterlee

    Version of Record online : 6 FEB 2012, DOI: 10.1002/cpe.2825

  4. CPU–GPU hybrid parallel strategy for cosmological simulations

    Concurrency and Computation: Practice and Experience

    Volume 26, Issue 3, 10 March 2014, Pages: 748–765, Yueqing Wang, Yong Dou, Song Guo, Yuanwu Lei and Dan Zou

    Version of Record online : 14 MAY 2013, DOI: 10.1002/cpe.3046

  5. Stepwise-refinement for performance: a methodology for many-core programming

    Concurrency and Computation: Practice and Experience

    Volume 27, Issue 17, 10 December 2015, Pages: 4515–4554, P. Hijma, R. V. van Nieuwpoort, C. J. H. Jacobs and H. E. Bal

    Version of Record online : 27 JAN 2015, DOI: 10.1002/cpe.3416

  6. Improving the user experience of the rCUDA remote GPU virtualization framework

    Concurrency and Computation: Practice and Experience

    Volume 27, Issue 14, 25 September 2015, Pages: 3746–3770, Carlos Reaño, Federico Silla, Adrián Castelló, Antonio J. Peña, Rafael Mayo, Enrique S. Quintana-Ortí and José Duato

    Version of Record online : 10 OCT 2014, DOI: 10.1002/cpe.3409

  7. Enhancing performance and energy consumption of runtime schedulers for dense linear algebra

    Concurrency and Computation: Practice and Experience

    Volume 26, Issue 15, October 2014, Pages: 2591–2611, Pedro Alonso, Manuel F. Dolz, Francisco D. Igual, Rafael Mayo and Enrique S. Quintana-Ortí

    Version of Record online : 25 JUN 2014, DOI: 10.1002/cpe.3317

  8. Methods for multitasking among real-time embedded compute tasks running on the GPU

    Concurrency and Computation: Practice and Experience

    Volume 29, Issue 15, 10 August 2017, Pınar Muyan-Özçelik and John D. Owens

    Version of Record online : 5 JUN 2017, DOI: 10.1002/cpe.4118

  9. IVM-based parallel branch-and-bound using hierarchical work stealing on multi-GPU systems

    Concurrency and Computation: Practice and Experience

    Volume 29, Issue 9, 10 May 2017, J. Gmys, M. Mezmaz, N. Melab and D. Tuyttens

    Version of Record online : 25 OCT 2016, DOI: 10.1002/cpe.4019

  10. On the benefits of the remote GPU virtualization mechanism: The rCUDA case

    Concurrency and Computation: Practice and Experience

    Volume 29, Issue 13, 10 July 2017, Federico Silla, Sergio Iserte, Carlos Reaño and Javier Prades

    Version of Record online : 8 FEB 2017, DOI: 10.1002/cpe.4072

  11. A scalable approach to solving dense linear algebra problems on hybrid CPU-GPU systems

    Concurrency and Computation: Practice and Experience

    Volume 27, Issue 14, 25 September 2015, Pages: 3702–3723, Fengguang Song and Jack Dongarra

    Version of Record online : 1 OCT 2014, DOI: 10.1002/cpe.3403

  12. Parallel resolution of the 3D Helmholtz equation based on multi-graphics processing unit clusters

    Concurrency and Computation: Practice and Experience

    Volume 27, Issue 13, 10 September 2015, Pages: 3205–3219, Gloria Ortega, Julia Lobera, Inmaculada García, M. Pilar Arroyo and Ester M. Garzón

    Version of Record online : 5 FEB 2014, DOI: 10.1002/cpe.3212

  13. Systematic adaptation of stencil-based 3D MPDATA to GPU architectures

    Concurrency and Computation: Practice and Experience

    Volume 29, Issue 9, 10 May 2017, Krzysztof Rojek, Roman Wyrzykowski and Lukasz Kuczynski

    Version of Record online : 16 SEP 2016, DOI: 10.1002/cpe.3970

  14. Compiler and runtime support for enabling reduction computations on heterogeneous systems

    Concurrency and Computation: Practice and Experience

    Volume 24, Issue 5, 10 April 2012, Pages: 463–480, Vignesh T. Ravi, Wenjing Ma, David Chiu and Gagan Agrawal

    Version of Record online : 2 OCT 2011, DOI: 10.1002/cpe.1848

  15. Work stealing for GPU-accelerated parallel programs in a global address space framework

    Concurrency and Computation: Practice and Experience

    Volume 28, Issue 13, 10 September 2016, Pages: 3637–3654, Humayun Arafat, James Dinan, Sriram Krishnamoorthy, Pavan Balaji and P. Sadayappan

    Version of Record online : 6 JAN 2016, DOI: 10.1002/cpe.3747

  16. Adaptation of fluid model EULAG to graphics processing unit architecture

    Concurrency and Computation: Practice and Experience

    Volume 27, Issue 4, 25 March 2015, Pages: 937–957, Krzysztof Andrzej Rojek, Milosz Ciznicki, Bogdan Rosa, Piotr Kopta, Michal Kulczewski, Krzysztof Kurowski, Zbigniew Pawel Piotrowski, Lukasz Szustak, Damian Karol Wojcik and Roman Wyrzykowski

    Version of Record online : 14 OCT 2014, DOI: 10.1002/cpe.3417

  17. Profiling divergences in GPU applications

    Concurrency and Computation: Practice and Experience

    Volume 25, Issue 6, 25 April 2013, Pages: 775–789, Bruno Coutinho, Diogo Sampaio, Fernando M. Q. Pereira and Wagner Meira Jr.

    Version of Record online : 12 JUN 2012, DOI: 10.1002/cpe.2853

  18. Graphics processing unit pricing of exotic cross-currency interest rate derivatives with a foreign exchange volatility skew model

    Concurrency and Computation: Practice and Experience

    Volume 26, Issue 9, 25 June 2014, Pages: 1609–1625, Duy Minh Dang, Christina C. Christara and Kenneth R. Jackson

    Version of Record online : 2 MAY 2012, DOI: 10.1002/cpe.2824

  19. Efficient parallel implementation of three-point viterbi decoding algorithm on CPU, GPU, and FPGA

    Concurrency and Computation: Practice and Experience

    Volume 26, Issue 3, 10 March 2014, Pages: 821–840, Rongchun Li, Yong Dou and Dan Zou

    Version of Record online : 11 JUL 2013, DOI: 10.1002/cpe.3093

  20. Manycore GPU processing of repeated range queries over streams of moving objects observations

    Concurrency and Computation: Practice and Experience

    Volume 29, Issue 4, 25 February 2017, Francesco Lettich, Salvatore Orlando, Claudio Silvestri and Christian S. Jensen

    Version of Record online : 30 JUN 2016, DOI: 10.1002/cpe.3881