• 1
    Olukotun K, Nayfeh BA, Hammond L, Wilson K, Chang K. The case for a single-chip multiprocessor, International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’96), 1996; 211.
  • 2
    Larus J. Spending Moore's dividend. Comm. of the ACM 2009; 52(5):6269.
  • 3
    Agarwal A. The tile processor: a 64-core multicore for embedded processing, High Performance Embedded Computing Workshop, 2007.
  • 4
    Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80113.
  • 5
    Kumar R, Tullsen DM, Jouppi NP, Ranganathan P. Heterogeneous chip multiprocessors. Computer 2005; 38(11):3238.
  • 6
    Kahle JA, Day MN, Hofstee HP, Johns CR, Maeurer TR, Shippy D. Introduction to the cell multiprocessor. IBM Journal of Research and Development 2005; 49(4/5):589604.
  • 7
    Fedorova A, Saez JC, Shelepov D, Prieto M. Maximizing power efficiency with asymmetric multicore systems. Communications of the ACM 2009; 52(12):4857.
  • 8
    Greenhalgh P. Big.LITTLE processing with ARM Cortex-A15 & Cortex-A7, ARM Ltd., 2011.
  • 9
    Contreras G, Martonosi M. Characterizing and improving the performance of intel threading building blocks, IEEE Intl. Symp. on Workload Characterization (IISWC 2008), 2008; 5766.
  • 10
    Barrow-Williams N, Fensch C, Moore S. A communication characterisation of Splash-2 and Parsec, IEEE International Symposium on Workload Characterization (IISWC 2009), 2009; 8697.
  • 11
    Trinder PW, Cole MI, Hammond K, Loidl HW, Michaelson GJ. Resource analyses for parallel and distributed coordination. Concurrency and Computation: Practice and Experience 2013; 25(3):309348.
  • 12
    Lea D. A Java fork/join framework, Conference on Java Grande, 2000; 3643.
  • 13
    Reinder J. Intel Threading Building Blocks: Outfitting C++ for Multi-core Processor Parallelism. O'Reilly, 2007.
  • 14
    Leijen D, Schulte W, Burckhardt S. The design of a task parallel library, Conference on object-oriented programming, systems, languages, and applications (oopsla '09). 2009; 227242.
  • 15
    McCool M, Robison AD, Reinders J. Structured Parallel Programming: Patterns for Efficient Computation. Morgan Kaufmann, 2012. ISBN 978-0-12-415993-8.
  • 16
    Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation. MIT Press, 1991. ISBN 0-262-53086-4.
  • 17
    Dean J, Ghemawat S. MapReduce: simplified data processing on large clusters. Communications of the ACM 2008; 51(1):107113.
  • 18
    Chamberlain BL, Callahan D, Zima HP. Parallel programmability and the chapel language. International Journal of High Performance Computing Applications 2007; 21(3):291312.
  • 19
    Charles P, Grothoff C, Saraswat V, Donawa C, Kielstra A, Ebcioglu K, von Praun C, Sarkar V. X10: an object-oriented approach to non-uniform cluster computing, Conference on object-oriented programming, systems, languages, and applications (oopsla ’05), 2005; 519538.
  • 20
    Steele GL, Jr. Parallel programming and parallel abstractions in fortress, International conference on parallel architectures and compilation techniques (pact 2005), 2005; 157.
  • 21
    El-Ghazawi T, Carlson W, Sterling T, Yelick K. UPC: Distributed Shared Memory Programming. John Wiley and Sons, 2005. ISBN 0-471-22048-5.
  • 22
    Numrich RW, Reid J. Co-array Fortran for parallel programming. SIGPLAN Fortran Forum 1998August; 17: 131.
  • 23
    Bird R, Wadler P. Introduction to Functional Programming. Prentice Hall, 1988.
  • 24
    Syme D, Granicz A, Cisternino A. Expert F#. Apress Academic, 2007. ISBN 1590598504.
  • 25
    Armstrong J, Virding R, Wikström C, Williams M. Concurrent Programming in Erlang, second. Prentice Hall, 1995. ISBN 978-0135083017.
  • 26
    Peyton Jones S. Haskell 98 Language and Libraries: the Revised Report. Cambridge University Press, 2003. ISBN 0521826144.
  • 27
    Odersky M, Altherr P, Cremet V, Dragos I, Dubochet G, Emir B, McDirmid S, Micheloud S, Mihaylov N, Schinz M, Stenman E, Spoon L, Zenger M. An overview of the Scala programming language. Technical Report LAMP-REPORT-2006-001, EPFL Lausanne, Switzerland, 2006. Second edition.
  • 28
    Prechelt L. An empirical comparison of seven programming languages. Computer 2000; 33(10):2329.
  • 29
    Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324(6096):446449.
  • 30
    Cockshott P, Gdura Y, Keir P. Array languages and the N-body problem. Concurrency and Computation: Practice and Experience 2013; 26(4):935951.
  • 31
    Šinkarovs A, Scholz SB, Bernecky R, Douma R, Grelck C. SAC/C formulations of the all-pairs N-body problem and their Performance on SMPs and GPGPUs. Concurrency and Computation: Practice and Experience 2013; 26(4):952971.
  • 32
    Goli M, González-Vélez H. N-body computations using skeletal frameworks on multi-core CPU and gpu architectures. Concurrency and Computation: Practice and Experience 2013; 26(4):972986.
  • 33
    Totoo P, Loidl HW. Parallel Haskell implementations of the N-body problem. Concurrency and Computation: Practice and Experience 2013; 26(4):9871019.