Now at mental images GmbH, Berlin, Germany.
STXXL: standard template library for XXL data sets
Article first published online: 6 AUG 2007
Copyright © 2007 John Wiley & Sons, Ltd.
Software: Practice and Experience
Volume 38, Issue 6, pages 589–637, May 2008
How to Cite
Dementiev, R., Kettner, L. and Sanders, P. (2008), STXXL: standard template library for XXL data sets. Softw: Pract. Exper., 38: 589–637. doi: 10.1002/spe.844
- Issue published online: 4 APR 2008
- Article first published online: 6 AUG 2007
- Manuscript Accepted: 3 JUN 2007
- Manuscript Revised: 30 MAY 2007
- Manuscript Received: 26 JAN 2007
- DFG. Grant Number: SA 933/1-2
- very large data sets;
- software library;
- C++ standard template library;
- algorithm engineering
We present the software library STXXL that is an implementation of the C++ standard template library (STL) for processing huge data sets that can fit only on hard disks. It supports parallel disks, overlapping between disk I/O and computation and it is the first I/O-efficient algorithm library that supports the pipelining technique that can save more than half of the I/Os. STXXL has been applied both in academic and industrial environments for a range of problems including text processing, graph algorithms, computational geometry, Gaussian elimination, visualization, and analysis of microscopic images, differential cryptographic analysis, etc. The performance of STXXL and its applications are evaluated on synthetic and real-world inputs. We present the design of the library, how its performance features are supported, and demonstrate how the library integrates with STL. Copyright © 2007 John Wiley & Sons, Ltd.