Abstract: The Banff classification for kidney allograft pathology has proved to be reproducible, but its inter and intraobserver agreement can vary substantially among centres. The aim of this study was to evaluate Banff reproducibility of surveillance renal allograft biopsies among renal pathologists from different transplant centres. This study included 32 renal transplant patients with stable graft function. Biopsies were performed 2 and 12 months post-transplant. Histology was interpreted according to the Banff schema by three renal pathologists, and inter and intraobserver agreement were measured. The best reproducibility was obtained for the presence or absence of acute rejection (AR), with kappa values ranging from moderate (κ = 0.47; p = 0.006) to good (κ = 0.72; p = 0.0001). However, the agreement for ‘suspicious for AR’ category was poor between all observers. For scoring and grading interstitial inflammation and intimal arteritis the agreement were poor and moderate, respectively. Reproducibility for the presence or absence of chronic allograft nephropathy (CAN) was heterogeneous, ranging from poor (κ = 0.13; p = NS) to moderate (κ = 0.56; p = 0.007). Scoring chronic changes such as fibrous intimal thickening gave a reasonable interobserver agreement. Intraobserver reproducibility was good for presence or absence of AR, but was poor for the diagnosis of CAN. In conclusion, histologic analysis of stable renal allografts based on Banff criteria showed a good agreement for the diagnosis of AR and a reasonable kappa for CAN, but reproducibility for scoring and grading showed a substantial interobserver variation.