Blood groups, finger prints, iris scans and DNA bar codes are several ways to distinguish or identify individual humans. According to Arumugam and colleagues (2011) we can now also be distinguished by the microbial communities in our faeces, our ‘enterotypes’.
The human gut is one of the most densely populated ecosystems known, and although this ecosystem contains members of the three domains of life – bacteria, archaea and eucarya (Finegold et al., 1983) – it is dominated by bacteria. They are essential in digesting the food we eat and in keeping us healthy by stimulating our immune system and fighting off pathogenic bacteria. It is estimated that 1013–1014 microbes inhabit our gastrointestinal tract (GIT), with the greatest number residing in the distal gut, where they synthesize essential vitamins and process otherwise indigestible components of our diet such as plant polysaccharides (Backhed et al., 2005). Early studies used 16S ribosomal DNA (rDNA) analysis to type and enumerate the distal gut and faecal microbiota. More than 90% of the bacterial phylotypes present in the intestinal microbiota in healthy humans are members of only three bacterial divisions: the Bacteroidetes, the Firmicutes and the Actinobacteria (Zoetendal et al., 2006).
Since the largest part (∼80%) of gut microbes remains uncultured, metagenomics analysis has become fashionable in the past 5 years to estimate the types, relative abundance and genome content of microbes in various parts of the GIT. The first metagenomics analysis from just two faecal samples (Gill et al., 2006) led to early insight into enrichment of genes encoding specific metabolic pathways, including metabolism associated with glycans, amino acids and xenobiotics, but also methanogenesis and biosynthesis of vitamins and isoprenoids. However, this early study provided only a very fragmented view due to limitations in sample size, sequencing technology, number of bases sequenced and availability of suitable reference genomes of gut inhabitants.
Gut microbial reference genomes
Large efforts have now been initiated to make catalogues of reference genomes from the human gut microbiome. Research groups from around the world have launched an International Human Microbiome Consortium (IHMC) which together aims to sequence more than 1000 human bacterial reference genomes, including 900 genomes from many body sites (but mainly the GIT) by the Human Microbiome Jumpstart Reference Strains Consortium (Nelson et al., 2010) (http://www.hmpdacc.org/project_catalog.html) and 100 gut genomes by the MetaHIT Consortium (http://www.sanger.ac.uk/resources/downloads/bacteria/metahit/). The MetaHIT sequencing project, coordinated by the Sanger Institute in the UK and funded by the European Commission, aims to sequence 30 strains of cultured gut bacteria, and 70 individual isolated cells of uncultured gut bacteria by single-cell genome sequencing (de Jager and Siezen, 2011).
Human gut microbial gene catalogues
Later metagenomics studies generated some 3 Gb of microbial sequence from faecal samples of 33 individuals from the USA and Japan (Kurokawa et al., 2007; Turnbaugh and Gordon, 2009; Turnbaugh et al., 2009). More recently, Illumina sequencing of DNA from faecal samples from 144 European adults generated 577 Gb of DNA sequence, almost 200 times more sequence than all previous studies (Qin et al., 2010). After assembly, a non-redundant set of 3.3 million gut ORFs was obtained which mapped well to 89 reference gut genomes. The cohort harboured a total of 1000–1150 prevalent bacterial species, and each individual had at least 160 of these species. This large data set allowed an in-depth assessment of the bacterial functions important for life in the gut, and gut-specific functions for adhesion to the host epithelial proteins and for harvesting sugars of the globoseries glycolipids (Qin et al., 2010).
Now, in a May 2011 issue of Nature, gut metagenomics data sets from 39 samples from six nationalities, including newly sequenced 22 European faecal samples, have been combined and the DNA sequence reads compared with 1511 reference genomes including 389 publicly available human microbiome genomes generated through the IHMC and MetaHIT consortia (Arumugam et al., 2011). Nearly 53% of the sequence reads could be robustly assigned to a genus in the reference genome set, and 80% could be assigned to a phylum. The phylogenetic composition confirmed that the Firmicutes and Bacteriodetes phyla are the most dominant in the human faecal samples. Multidimensional cluster analysis and principal component analysis (PCA) revealed that the gut samples separate into three robust clusters, designated as ‘enterotypes’. Each of the three enterotypes is identifiable by variation in the levels of one of three main genera: Bacteriodetes (enterotype 1), Prevotella (enterotype 2) and Ruminococcus (enterotype 3) (Fig. 1a). The same analysis performed on two larger published human microbiome data sets generated the same three enterotype clusters (Fig. 1b and c).
Phylogenetic and functional differences among enterotypes seem to reflect different combinations of microbial trophic chains. The abundances and co-occurrence networks of the three enterotypes from the Sanger metagenomes are shown in Fig. 2. The drivers of enterotype 1 (mainly Bacteriodetes, co-occurring with, e.g. Parabacteriodetes) appear to derive energy mainly from fermentation of carbohydrates as these genera have a very broad saccharolytic potential. In enterotype 2, the Prevotella and co-occurring Desulfovibrio can act in synergy to degrade mucin glycoproteins of the gut mucosal layer, while in enterotype 3 the Ruminococcus and Akkermansia can bind mucins and transport and degrade the constituent sugars. Enterotypes 1 and 2 are also enriched in biosynthesis of different vitamins (Arumugam et al., 2011). Surprisingly, the enterotypes do not correlate with host properties such as nationality, age, gender or body mass index. However, data-driven marker genes or functional modules were identified for most of these host properties.
This study indicates the existence of a limited number of well-balanced host–microbial symbiotic states in human faeces that presumably reflect the microbial composition in the large intestine. Perhaps it would have been wiser to call these ‘faecotypes’ than ‘enterotypes’, since it is well known that the microbial abundance and composition changes dramatically throughout the GIT. It remains to be seen whether such stable and universal microbial consortia are also found in the small intestine where the genera Streptococcus, Clostridium and Veillonella are dominant (Booijink et al., 2007; 2010a,b; van den Bogert et al., 2011).
This is an exciting field of research, where much is still to be learned about how stable enterotypes are formed and maintained in humans and other animals, and whether they can be influenced by diets, probiotics, drugs or diseases (Kau et al., 2011; Walter and Ley, 2011).
The research that led to the discovery of the enterotypes was in part funded by the European Community's Seventh Framework Programme (FP7/2007-2013) in the project MetaHIT; grant agreement HEALTH-F4-2007-201052.