University of Rochester College of Business. This paper has benefited from comments and criticisms by G. Benston, E. Fama, J. Keilson, H. Weingartner, and especially M. Scholes.
THE PERFORMANCE OF MUTUAL FUNDS IN THE PERIOD 1945–1964
Article first published online: 30 APR 2012
1968 The American Finance Association
The Journal of Finance
Volume 23, Issue 2, pages 389–416, May 1968
How to Cite
Jensen, M. C. (1968), THE PERFORMANCE OF MUTUAL FUNDS IN THE PERIOD 1945–1964. The Journal of Finance, 23: 389–416. doi: 10.1111/j.1540-6261.1968.tb00815.x
- Issue published online: 30 APR 2012
- Article first published online: 30 APR 2012
A CENTRAL PROBLEM IN FINANCE (and especially portfolio management) has been that of evaluating the “performance” of portfolios of risky investments. The concept of portfolio “performance” has at least two distinct dimensions:
- 1)The ability of the portfolio manager or security analyst to increase returns on the portfolio through successful prediction of future security prices, and
- 2)The ability of the portfolio manager to minimize (through “efficient” diversification) the amount of “insurable risk” born by the holders of the portfolio.
The major difficulty encountered in attempting to evaluate the performance of a portfolio in these two dimensions has been the lack of a thorough understanding of the nature and measurement of “risk.” Evidence seems to indicate a predominance of risk aversion in the capital markets, and as long as investors correctly perceive the “riskiness” of various assets this implies that “risky” assets must on average yield higher returns than less “risky” assets.1 Hence in evaluating the “performance” of portfolios the effects of differential degrees of risk on the returns of those portfolios must be taken into account.
Recent developments in the theory of the pricing of capital assets by Sharpe , Lintner  and Treynor  allow us to formulate explicit measures of a portfolio's performance in each of the dimensions outlined above. These measures are derived and discussed in detail in Jensen . However, we shall confine our attention here only to the problem of evaluating a portfolio manager's predictive ability—that is his ability to earn returns through successful prediction of security prices which are higher than those which we could expect given the level of riskiness of his portfolio. The foundations of the model and the properties of the performance measure suggested here (which is somewhat different than that proposed in ) are discussed in Section II. The model is illustrated in Section III by an application of it to the evaluation of the performance of 115 open end mutual funds in the period 1945–1964.
A number of people in the past have attempted to evaluate the performance of portfolios2 (primarily mutual funds), but almost all of these authors have relied heavily on relative measures of performance when what we really need is an absolute measure of performance. That is, they have relied mainly on procedures for ranking portfolios. For example, if there are two portfolios A and B, we not only would like to know whether A is better (in some sense) than B, but also whether A and B are good or bad relative to some absolute standard. The measure of performance suggested below is such an absolute measure.3 It is important to emphasize here again that the word “performance” is used here only to refer to a fund manager's forecasting ability. It does not refer to a portfolio's “efficiency” in the Markowitz-Tobin sense. A measure of “efficiency” and its relationship to certain measures of diversification and forecasting ability is derived and discussed in detail in Jensen . For purposes of brevity we confine ourselves here to an examination of a fund manager's forecasting ability which is of interest in and of itself (witness the widespread interest in the theory of random walks and its implications regarding forecasting success).
In addition to the lack of an absolute measure of performance, these past studies of portfolio performance have been plagued with problems associated with the definition of “risk” and the need to adequately control for the varying degrees of riskiness among portfolios. The measure suggested below takes explicit account of the effects of “risk” on the returns of the portfolio.
Finally, once we have a measure of portfolio “performance” we also need to estimate the measure's sampling error. That is we want to be able to measure its “significance” in the usual statistical sense. Such a measure of significance also is suggested below.
II. The Model
The Foundations of the Model.—As mentioned above, the measure of portfolio performance summarized below is derived from a direct application of the theoretical results of the capital asset pricing models derived independently by Sharpe , Lintner  and Treynor . All three models are based on the assumption that (1) all investors are averse to risk, and are single period expected utility of terminal wealth maximizers, (2) all investors have identical decision horizons and homogeneous expectations regarding investment opportunities, (3) all investors are able to choose among portfolios solely on the basis of expected returns and variance of returns, (4) all transactions costs and taxes are zero, and (5) all assets are infinitely divisible. Given the additional assumption that the capital market is in equilibrium, all three models yield the following expression for the expected one period return,4 E(R̃j), on any security (or portfolio) j:
where the tildes denote random variables, and
RF = the one-period risk free interest rate.
βj = = the measure of risk (hereafter called systematic risk) which the asset pricing model implies is crucial in determining the prices of risky assets.
E(R̃M) = the expected one-period return on the “market portfolio” which consists of an investment in each asset in the market in proportion to its fraction of the total value of all assets in the market.
Thus eq. (1) implies that the expected return on any asset is equal to the risk free rate plus a risk premium given by the product of the systematic risk of the asset and the risk premium on the market portfolio.5 The risk premium on the market portfolio is the difference between the expected returns on the market portfolio and the risk free rate.
Equation (1) then simply tells us what any security (or portfolio) can be expected to earn given its level of systematic risk, βj. If a portfolio manager or security analyst is able to predict future security prices he will be able to earn higher returns that those implied by eq. (1) and the riskiness of his portfolio. We now wish to show how (1) can be adapted and extended to provide an estimate of the forecasting ability of any portfolio manager. Note that (1) is stated in terms of the expected returns on any security or portfolio j and the expected returns on the market portfolio. Since these expectations are strictly unobservable we wish to show how (1) can be recast in terms of the objectively measurable realizations of returns on any portfolio j and the market portfolio M.
In  it was shown that the single period models of Sharpe, Lintner, and Treynor can be extended to a multiperiod world in which investors are allowed to have heterogeneous horizon periods and in which the trading of securities takes place continuously through time. These results indicate that we can generalize eq. (1) and rewrite it as
where the subscript t denotes an interval of time arbitrary with respect to length and starting (and ending) dates.
where bj is a parameter which may vary from security to security and is an unobservable “market factor” which to some extent affects the returns on all securities, and N is the total number of securities in the market.6 The variables and the are assumed to be independent normally distributed random variables with
It is also shown in  that the linear relationships of eqs. (1a) and (2) hold for any length time interval as long as the returns are measured as continuously compounded rates of return. Furthermore to a close approximation the return on the market portfolio can be expressed as7
Since evidence given in [1, 11] indicates that the market model, given by eqs. (2) and (3a) ≅ (3d), holds for portfolios as well as individual securities, we can use (2) to recast (1a) in terms of ex post returns.8 Substituting for E(R̃Mt) in (1a) from (4) and adding to both sides of (1a) we have
Thus assuming that the asset pricing model is empirically valid,10 eq. (6) says that the realized returns on any security or portfolio can be expressed as a linear function of its systematic risk, the realized returns on the market portfolio, the risk free rate and a random error, , which has an expected value of zero. The term RFt can be subtracted from both sides of eq. (6), and since its coefficient is unity the result is
The left hand side of (7) is the risk premium earned on the j'th portfolio. As long as the asset pricing model is valid this premium is equal to βj[R̃Mt − RFt] plus the random error term .
The Measure of Performance.—Furthermore eq. (7) may be used directly for empirical estimation. If we wish to estimate the systematic risk of any individual security or of an unmanaged portfolio the constrained regression estimate of βj in eq. (7) will be an efficient estimate11 of this systematic risk. However, we must be very careful when applying the equation to managed portfolios. If the manager is a superior forecaster (perhaps because of special knowledge not available to others) he will tend to systematically select securities which realize > 0. Hence his portfolio will earn more than the “normal” risk premium for its level of risk. We must allow for this possibility in estimating the systematic risk of a managed portfolio.
Allowance for such forecasting ability can be made by simply not constraining the estimating regression to pass through the origin. That is, we allow for the possible existence of a non-zero constant in eq. (7) by using eq. (8) as the estimating equation.
The new error term ũjt will now have E(ũjt) = 0, and should be serially independent.12
Thus if the portfolio manager has an ability to forecast security prices, the intercept, αj, in eq. (8) will be positive. Indeed, it represents the average incremental rate of return on the portfolio per unit time which is due solely to the manager's ability to forecast future security prices. It is interesting to note that a naive random selection buy and hold policy can be expected to yield a zero intercept. In addition if the manager is not doing as well as a random selection buy and hold policy, αj will be negative. At first glance it might seem difficult to do worse than a random selection policy, but such results may very well be due to the generation of too many expenses in unsuccessful forecasting attempts.
However, given that we observe a positive intercept in any sample of returns on a portfolio we have the difficulty of judging whether or not this observation was due to mere random chance or to the superior forecasting ability of the portfolio manager. Thus in order to make inferences regarding the fund manager's forecasting ability we need a measure of the standard error of estimate of the performance measure. Least squares regression theory provides an estimate of the dispersion of the sampling distribution of the intercept αj. Furthermore, the sampling distribution of the estimate, is a student t distribution with nj−2 degrees of freedom. These facts give us the information needed to make inferences regarding the statistical significance of the estimated performance measure.
It should be emphasized that in estimating αj; the measure of performance, we are explicitly allowing for the effects of risk on return as implied by the asset pricing model. Moreover, it should also be noted that if the model is valid, the particular nature of general economic conditions or the particular market conditions (the behavior of π) over the sample or evaluation period has no effect whatsoever on the measure of performance. Thus our measure of performance can be legitimately compared across funds of different risk levels and across differing time periods irrespective of general economic and market conditions.
The Effects of Non-Stationarity of the Risk Parameter.—It was pointed out earlier13 that by omitting the time subscript from αj (the risk parameter in eq. (8) we were implicitly assuming the risk level of the portfolio under consideration is stationary through time. However, we know this need not be strictly true since the portfolio manager can certainly change the risk level of his portfolio very easily. He can simply switch from more risky to less risky equities (or vice versa), or he can simply change the distribution of the assets of the portfolio between equities, bonds and cash. Indeed the portfolio manager may consciously switch his portfolio holdings between equities, bonds and cash in trying to outguess the movements of the market.
This consideration brings us to an important issue regarding the meaning of “forecasting ability.” A manager's forecasting ability may consist of an ability to forecast the price movements of individual securities and/or an ability to forecast the general behavior of security prices in the future (the “market factor” π in our model). Therefore we want an evaluation model which will incorporate and reflect the ability of the manager to forecast the market's behavior as well as his ability to choose individual issues.
Fortunately the model outlined above will also measure the success of these market forecasting or “timing” activities as long as we can assume that the portfolio manager attempts on average to maintain a given level of risk in his portfolio. More formally as long as we can express the risk of the j'th portfolio at any time t as
where βj is the “target” risk level which the portfolio manager wishes to maintain on average through time, and is a normally distributed random variable (at least partially under the manager's control) with . The variable is the vehicle through which the manager may attempt to capitalize on any expectations he may have regarding the behavior of the market factor in the next period. For example if the manager (correctly) perceives that there is a higher probability that π will be positive (rather than negative) next period, he will be able to increase the returns on his portfolio by increasing its risk,14 i.e., by making εjt positive this period. On the other hand he can reduce the losses (and therefore increase the average returns) on the portfolio by reducing the risk level of the portfolio (i.e., making εjt negative) when the market factor π is expected to be negative. Thus if the manager is able to forecast market movements to some extent, we should find a positive relationship between and . We can state this relationship formally as:
where the error term w̃jt is assumed to be normally distributed with E(w̃jt) = 0. The coefficient aj will be positive if the manager has any forecasting ability and zero if he has no forecasting ability. We can rule out aj < 0, since as a conscious policy this would be irrational. Moreover, we can rule out aj < 0 caused by perverse forecasting ability since this also implies knowledge of and would therefore be reflected in a positive aj as long as the manager learned from past experience. Note also that eq. (10) includes no constant term since by construction this would be included in βj in eq. (9). In addition we note that while aj will be positive only if the manager can forecast its size will depend on the manager's willingness to bet on his forecasts. His willingness to bet on his forecasts will of course depend on his attitudes towards taking these kinds of risks and the certainty with which he views his estimates.
Now as long as the estimated risk parameter is an unbiased estimate of the average risk level βj, the estimated performance measure () will also be unbiased. Under the assumption that the forecast error w̃jt is uncorrelated with πt (which is certainly reasonable), it can be shown15 that the expected value of the least squares estimator is:
Thus the estimate of the risk parameter is biased downward by an amount given by aj E(R̃M), where aj is the parameter given in eq. (10) (which describes the relationship between and ). By the arguments given earlier aj can never be negative and will be equal to zero when the manager possesses no market forecasting ability. This is important since it means that if the manager is unable to forecast general market movements we obtain an unbiased estimate of his ability to increase returns on the portfolio by choosing individual securities which are “undervalued.”
However, if the manager does have an ability to forecast market movements we have seen that aj will be positive and therefore as shown in eq. (12) the estimated risk parameter will be biased downward. This means, of course, that the estimated performance measure () will be biased upward (since the regression line must pass through the point of sample means).
Hence it seems clear that if the manager can forecast market movements at all we most certainly should see evidence of it since our techniques will tend to overstate the magnitude of the effects of this ability. That is, the performance measure, αj, will be positive for two reasons: (1) the extra returns actually earned on the portfolio due to the manager's ability, and (2) the positive bias in the estimate of αj resulting from the negative bias in our estimate of βj.
III. The Data and Empirical Results
The Data.—The sample consists of the returns on the portfolios of 115 open end mutual funds for which net asset and dividend information was available in Wiesenberger's Investment Companies for the ten-year period 1955–64.16 The funds are listed in Table 1 along with an identification number and code denoting the fund objectives (growth, income, etc.). Annual data were gathered for the period 1955–64 for all 115 funds and as many additional observations as possible were collected for these funds in the period 1945–54.
|141||0||Affiliated Fund, Inc.|
|142||2||American Business Shares, Inc.|
|144||3||American Mutual Fund, Inc.|
|145||4||Associated Fund Trust|
|146||0||Atomics, Physics + Science Fund, Inc.|
|147||2||Axe-Houghton Fund B, Inc.|
|1148||2||Axe-Houghton Fund A, Inc.|
|2148||0||Axe-Houghton Stock Fund, Inc.|
|150||3||Blue Ridge Mutual Fund, Inc.|
|151||2||Boston Fund, Inc.|
|152||4||Broad Street Investing Corp.|
|153||3||Bullock Fund, Ltd.|
|155||0||Canadian Fund, Inc.|
|157||0||Century Shares Trust|
|158||0||The Channing Growth Fund|
|1159||0||Channing Income Fund, Inc.|
|2159||3||Channing Balanced Fund|
|160||3||Channing Common Stock Fund|
|162||0||Chemical Fund, Inc.|
|163||4||The Colonial Fund, Inc.|
|164||0||Colonial Growth + Energy Shares, Inc.|
|165||2||Commonwealth Fund—Plan C|
|166||2||Commonwealth Investment Co.|
|167||3||Commonwealth Stock Fund|
|168||2||Composite Fund, Inc.|
|169||4||Corporate Leaders Trust Fund Certificates, Series “B”|
|171||3||Delaware Fund, Inc.|
|172||0||De Vegh Mutual Fund, Inc. (No Load)|
|173||0||Diversified Growth Stock Fund, Inc.|
|174||2||Diversified Investment Fund, Inc.|
|175||4||Dividend Shares, Inc.|
|176||0||Dreyfus Fund Inc.|
|177||2||Eaton + Howard Balanced Fund|
|178||3||Eaton + Howard Stock Fund|
|180||3||Equity Fund, Inc.|
|182||3||Fidelity Fund, Inc.|
|184||3||Financial Industrial Fund, Inc.|
|185||3||Founders Mutual Fund|
|1186||0||Franklin Custodian Funds, Inc.—Utilities Series|
|2186||0||Franklin Custodial Funds, Inc.—Common Stock Series|
|187||3||Fundamental Investors, Inc.|
|188||2||General Investors Trust|
|189||0||Growth Industry Shares, Inc.|
|190||4||Group Securities—Common Stock Fund|
|1191||0||Group Securities—Aerospace—Science Fund|
|2191||2||Group Securities—Fully Administered Fund|
|192||3||Guardian Mutual Fund, Inc. (No Load)|
|193||3||Hamilton Funds, Inc.|
|194||0||Imperial Capital Fund, Inc.|
|195||2||Income Foundation Fund, Inc.|
|197||1||Incorporated Income Fund|
|200||3||The Investment Company of America|
|201||2||The Investors Mutual, Inc.|
|202||3||Investors Stock Fund, Inc.|
|203||1||Investors Selective Fund, Inc.|
|205||3||Investment Trust of Boston|
|206||2||Istel Fund, Inc.|
|207||3||The Johnston Mutual Fund Inc. (No-Load)|
|208||3||Keystone High-Grade Common Stock Fund (S-1)|
|1209||4||Keystone Income Common Stock Fund (S-2)|
|2209||0||Keystone Growth Common Stock Fund (S-3)|
|210||0||Keystone Lower-Priced Common Stock Fund (S-4)|
|1211||1||Keystone Income Fund—(K-1)|
|2211||0||Keystone Growth Fund (K-2)|
|1212||1||The Keystone Bond Fund (B-3)|
|2212||1||The Keystone Bond Fund (B-4)|
|215||2||Loomis-Sayles Mutual Fund, Inc. (No Load)|
|216||0||Massachusetts Investors Growth Stock Fund, Inc.|
|217||3||Massachusetts Investors Trust|
|218||2||Massachusetts Life Fund|
|219||4||Mutual Investing Foundation, MIF Fund|
|220||2||Mutual Investment Fund, Inc.|
|221||0||National Investors Corporation|
|222||4||National Securities Stock Series|
|1223||0||National Securities—Growth Stock Series|
|2223||1||National Securities—Income Series|
|224||1||National Securities—Dividend Series|
|225||2||Nation-Wide Securities Company, Inc.|
|226||2||New England Fund|
|227||4||Northeast Investors Trust (No Load)|
|231||3||Philadelphia Fund, Inc.|
|232||4||Pine Street Fund, Inc. (No Load)|
|233||3||Pioneer Fund, Inc.|
|234||0||T. Rowe Price Growth Stock Fund, Inc. (No Load)|
|235||1||Puritan Fund, Inc.|
|236||2||The George Putnam Fund of Boston|
|239||2||Research Investing Corp.|
|240||2||Scudder, Stevens + Clark Balanced Fund, Inc. (No Load)|
|241||3||Scudder, Stevens + Clark Common Stock Fund, Inc. (No Load)|
|243||3||Selected American Shares, Inc.|
|244||2||Shareholders' Trust of Boston|
|245||3||State Street Investment Corporation (No Load)|
|246||2||Stein Roe + Farnham Balanced Fund, Inc. (No Load)|
|247||0||Stein Roe + Farnham International Fund, Inc. (No Load)|
|249||0||Television-Electronics Fund, Inc.|
|250||0||Texas Fund, Inc.|
|251||3||United Accumulative Fund|
|252||4||United Income Fund|
|253||0||United Science Fund|
|254||1||The Value Line Income Fund, Inc.|
|255||0||The Value Line Fund, Inc.|
|256||4||Washington Mutual Investors Fund, Inc.|
|257||2||Wellington Fund, Inc.|
|259||3||Wisconsin Fund, Inc.|
|260||2||Composite Bond and Stock Fund, Inc.|
|1261||3||Crown Western-Diversified Fund (D-2)|
|2261||2||Dodge + Cox Balanced Fund (No Load)|
|2262||2||Fiduciary Mutual Investing Company, Inc.|
|263||4||The Knickerbocker Fund|
|267||4||Southwestern Investors, Inc.|
|1268||2||Wall Street Investing Corporation|
|2268||2||Whitehall Fund, Inc.|
|1000||0||College Retirement Equities Fund|
For this earlier period, 10 years of complete data were obtained for 56 of the original 115 funds.
Definitions of the Variables.—The following are the exact definitions of the variables used in the estimation procedures:
S̃t = Level of the Standard and Poor Composite 500 price index17 at the end of year t.
D̃t = Estimate of dividends received on the market portfolio in year t as measured by annual observations on the four quarter moving average18 of the dividends paid by the companies in the composite 500 Index (stated on the same scale as the level of the S & P 500 Index).
The estimated annual continuously compounded rate of return on the market portfolio M for year t.
ÑAjt = Per share net asset value of the j'th fund at the end of year t.
ID̃jt = Per share “income” dividends paid by the j'th fund during year t.
C̃Gjt = Per share “Capital gains” distributions paid by the j'th fund during year t.
The annual continuously compounded rate of return on the j'th fund during year t. (Adjusted for splits and stock dividends.)19
rt = Yield to maturity of a one-year government bond at the beginning of year t (obtained from Treasury Bulletin yield curves).
RFt = loge(1 + rt) = Annual continuously compounded risk free rate of return for year t.
nj = The number of yearly observations of the j'th fund. 10 ≤ nj ≤ 20.
The Empirical Results.—Table 2 presents some summary statistics of the frequency distributions of the regression estimates of the parameters of eq. (8) for all 115 mutual funds using all sample data available for each fund in the period 1945–64. The table presents the mean, median, extreme values, and mean absolute deviation of the 115 estimates of α, β, r2, and ρ(ut, ut−1) (the first order autocorrelation of residuals). As can be seen in the table the average intercept was −.011 with a minimum value of −.078 and a maximum value of .058. We defer a detailed discussion of the implications of these estimated intercepts for a moment.
|Item||Mean Value||Median Value||Extreme Values||Mean Absolute Deviation*|
Since the average value of β was only .840, on average these funds tended to hold portfolios which were less risky than the market portfolio. Thus any attempt to compare the average returns on these funds to the returns on a market index without explicit adjustment for differential riskiness would be biased against the funds. The average squared correlation coefficient, r̂2, was .865 and indicates in general that eq. (8) fits the data for most of the funds quite closely. The average first order autocorrelation of residuals, −.077, is quite small as expected.
Our primary concern in this paper is the interpretation of the estimated intercepts. They are presented in Table 3 along with the fund identification number and the “t” values and sample sizes. The observations are ordered from lowest to highest on the basis of . The estimates range from −.0805 to +.0582. Table 4 and Figure 1–Figure 4 present summary frequency distributions of these estimates (along with the distributions of the coefficients estimated for several other time intervals which will be discussed below).
|Fund ID Number||Number of Observations|
|Class Interval||All Funds Entire Sample Period*||56 Funds 20 Years 1945–64 Gross Returns||All Funds 10 Years 1955–64 Gross Returns|
|Net Returns||Gross Returns|
|.06 ≤ < .07||0||1||0||0|
|.05 ≤ < .06||1||0||0||1|
|.04 ≤ < .05||0||0||0||0|
|.03 ≤ < .04||0||1||1||1|
|.02 ≤ < .03||3||9||2||12|
|.01 ≤ < .02||12||16||8||15|
|.0 ≤ < .01||23||21||13||31|
|−.01 < < .0||22||29||17||12|
|−.02 < ≤ −.01||21||14||6||13|
|−.03 < ≤ −.02||12||11||5||12|
|−.04 < ≤ −.03||9||9||2||3|
|−.05 < ≤ −.04||8||1||1||1|
|.06 < ≤ −.05||1||1||1||1|
|.07 < ≤ −.06||1||0||0||0|
|−.08 < ≤ −.07||1||2||0||0|
|−.09 < ≤ −.08||1||0||0||1|
In order to obtain additional information about the forecasting success of fund managers eq. (8) was also estimated using fund returns calculated before deduction of fund expenses as well as after. Fund loading charges were ignored in all cases.20 Columns 1 and 2 of Table 4 and Figure 1 and Figure 2 present the frequency distributions of the estimated α's obtained by using all sample data available for each fund. The number of observations in the estimating equation varies from 10 to 20 and the time periods are obviously not all identical. Column 1 and Figure 1 present the frequency distribution of the 115 intercepts estimated on the basis of fund returns calculated net of all expenses. Column 2 of Table 4 and Figure 2 present the frequency distributions of the estimates obtained from the fund returns calculated before deductions of management expenses (as given by Wiesenberger 21).
The average value of calculated net of expenses was −.011 which indicates that on average the funds earned about 1.1% less per year (compounded continuously) than they should have earned given their level of systematic risk. It is also clear from Figure 1 that the distribution is skewed to the low side with 76 funds having and only 39 with .
The model implies that with a random selection buy and hold policy one should expect on average to do no worse than α = 0. Thus it appears from the preponderance of negative that the funds are not able to forecast future security prices well enough to recover their research expenses, management fees and commission expenses.
In order to examine this point somewhat more closely the α's were also estimated on the basis of returns calculated gross of all management expenses.22 That is R̃jt was taken to be
where Ejt is the estimated per share dollar value of all expenses except brokerage commissions, interest and taxes (the latter two of which are small) for the j'th fund in year t obtained from . Now when the estimates are based on gross returns any forecasting success of the funds (even if not sufficient to cover their expenses) should be revealed by positive .
The results shown in Column 2 of Table 4 indicate the average estimated from gross return data was −.004 or −.4% per year, with 67 funds for which < 0 and 48 for which > 0. The frequency distribution, plotted in Figure 2, is much more symmetric than the distribution obtained from the net returns. Thus it appears that on average during this 20-year period the funds were not able to increase returns enough by their trading activities to recoup even their brokerage commissions (the only expenses which were not added back to the fund returns).
In order to avoid the difficulties associated with non-identical time periods and unequal sample sizes, the measures for the 56 funds for which data were available for the entire 20-year period are summarized in Column 3 of Table 4 and Figure 3. The results indicate an average of −.032 with 32 funds for which and 24 funds for which . It is very likely that part of this apparently poorer gross performance is due to the method used in approximating the expenses for the years prior to 1955. It was noted earlier that the expenses for these earlier years were assumed to be equal to the expenses for 1955. But since these expense ratios were declining in the earlier period these estimates are undoubtedly too low.
Finally in order to avoid any difficulty associated with the estimates of the expenses before 1955, the measures were estimated for each of the 115 funds using only the gross return data in the 10-year period 1955–64. The frequency distribution of the is given in Column 4 of Table 4 and Figure 4. The average for this period was −.001 or −.1% per year with 55 funds for which and 60 funds for which . The reader must be careful about placing too much significance on the seemingly larger number of funds with . It is well known that measurement errors (even though unbiased) in any independent variable will cause the estimated regression coefficient of that variable to be attenuated towards zero (cf. [12, chap. 6]). Since we know that there are undoubtedly some errors in the measurement of both the riskless rate and the estimated returns on the market portfolio, the coefficients are undoubtedly slightly downward biased. This of course results in an upward bias in the estimates of the αj since the least squares regression line must pass through the point of means.
There is one additional item which tends to bias the results slightly against the funds. That is, the model implicitly assumes the portfolio is fully invested. But since the mutual funds face stochastic inflows and outflows they must maintain a cash balance to meet them. Data presented in [8, pp. 120–127] indicate that on average the funds appear to hold about 2% of their total net assets in cash. If we assume the funds had earned the riskless rate on these assets (about 3% per year) this would increase their returns (and the average ) by about (.02) (.03) = .0006 per year. Thus the adjusted average α is about −.0004, and it is now getting very difficult to say that this is really different from zero. Thus, let us now give explicit consideration to these questions of “significance.”
The “Significance” of the Estimates.—We now address ourselves to the question regarding the statistical significance of the estimated performance measures. Table 3 presents a listing of the “t” values for the individual funds, the intercepts, and the number of observations used in obtaining each estimate. We noted earlier that it is possible for a fund manager to do worse than a random selection policy since it is easy to lower a fund's returns by unwisely spending resources in unsuccessful attempts to forecast security prices. The fact that the shown in Table 3 and Figure 1 are skewed to the left indicate this may well be true. Likewise an examination of the “t” values given in Table 3 and plotted in Figure 5 indicates that the t values for 14 of the funds were less than −2 and hence are all significantly negative at the 5% level.23 However, since we had little doubt that it was easy to do worse than a random policy we are really interested mainly in testing the significance of the large positive performance measures.
An examination of Column 3 of Table 3 reveals only 3 funds which have performance measures which are significantly positive at the 5% level. But before concluding that these funds are superior we should remember that even if all 115 of these funds had a true α equal to zero, we would expect (merely because of random chance) to find 5% of them or about 5 or 6 funds yielding t values “significant” at the 5% level. Thus, henceforth we shall concentrate on an examination of the entire frequency distribution of the estimated t values to see whether we observe more than the expected number of significant values. Unfortunately because of the differing degrees of freedom among the observations plotted in Figure 5 and Figure 6 (which contains the gross estimates), the frequency distributions are somewhat difficult to interpret.
However Figure 7 presents the frequency distribution of the t values calculated on the basis of gross returns for the 56 funds for which 20 complete years of data were available. The t value for the one-tail 2.5% level of significance is 2.1, and thus we expect (.025) (56) = 1.4 observations with t values greater than 2.1. We observe just one. Again we also observe a definite skewness towards the negative values and no evidence of an ability to forecast security prices. It is interesting to note that if the model is valid and if we have indeed returned all expenses to the funds, these distributions should be symmetric about zero. However, we have not added back any of the brokerage commissions and have used estimates of the expenses for the years 1945–54 which we strongly suspect are biased low. Thus the results shown in Figure 7 are not too surprising.
As mentioned above, in order to avoid some of these difficulties and to test more precisely whether or not the funds were on average able to forecast well enough to cover their brokerage expenses (even if not their other expenses) the performance measures were estimated just for the period 1955–64. The frequency distribution for the t values of the intercepts of the 115 funds estimated from gross returns is given in Figure 8 and column 4 of Table 5. All the observations have 8 degrees of freedom, and the maximum and minimum values are respectively +2.17 and −2.84. It seems clear from the symmetry of this distribution about zero and especially from the lack of any values greater than +2.2 that there is very little evidence that any of these 115 mutual funds in this 10-year period possessed substantial forecasting ability. We refrain from making a strict formal interpretation of the statistical significance of these numbers and warn the reader to do likewise since there is a substantial amount of evidence (cf. [4, 18]) which indicates the normality assumptions on the residuals, ũjt, of (8) may not be valid. We also point out that one could also perform chi-square goodness of fit tests on the t distributions presented, but for the same reasons mentioned above we refrain from doing so. That is, if the residuals are not normally distributed the estimates of the parameters will not be distributed according to the student t distribution, and therefore it doesn't really make sense to make formal goodness of fit tests against the “t” distribution.
|Class Interval||All Funds Entire Sample Period**||56 Funds 20 Years 1945–64 Gross Returns||All Funds 10 Years 1955–64 Gross Returns|
|Net Returns||Gross Returns|
|4 ≤ t() < 5||0||0||0||0|
|3 ≤ t() < 4||0||0||0||0|
|2 ≤ t() < 3||1||5||1||2|
|1 ≤ t() < 2||10||15||7||21|
|0 ≤ t() 1||28||28||15||37|
|−1 < t() < 0||32||41||20||36|
|−2 < t() ≤ −1||30||21||8||15|
|−3 < t() ≤ −2||10||2||2||4|
|−4 < t() ≤ −3||1||2||2||0|
|− 5 < t() ≤ −4||3||1||1||0|
However, while the possible non-normality of these disturbances causes problems in attempting to perform the usual types of significance tests, it should be emphasized that the model itself is in no way crucially dependent on this assumption. Wise  has shown that the least squares estimates of bj in (2) are unbiased and consistent if the disturbance terms uj conform to the symmetric and finite mean members of the stable class of distributions. Furthermore, Fama  has demonstrated that the capital asset pricing model results (eq. (1)) can still be obtained in the context of these distributions. A complete discussion of the issues associated with this distributional problem and their relationship to the portfolio evaluation problem is available in  and will not be repeated here. It is sufficient to reiterate the fact that the normality assumption is necessary only in order to perform the strict tests of significance, and we warn the reader to interpret these tests as merely suggestive until the state of stable distribution theory is developed to the point where strict tests of significance can be legitimately performed.
It is important to note in examining the empirical results presented above that the mutual fund industry (as represented by these 115 funds) shows very little evidence of an ability to forecast security prices. Furthermore there is surprisingly little evidence that indicates any individual funds in the sample might be able to forecast prices. These results are even stronger when one realizes that the biases in the estimates24 all tend to either exaggerate the magnitude of any forecasting ability which might exist25 or tend to show evidence of forecasting ability where none exists.
The evidence on mutual fund performance discussed above indicates not only that these 115 mutual funds were on average not able to predict security prices well enough to outperform a buy-the-market-and-hold policy, but also that there is very little evidence that any individual fund was able to do significantly better than that which we expected from mere random chance. It is also important to note that these conclusions hold even when we measure the fund returns gross of management expenses (that is assume their bookkeeping, research, and other expenses except brokerage commissions were obtained free). Thus on average the funds apparently were not quite successful enough in their trading activities to recoup even their brokerage expenses.
It is also important to remember that we have not considered in this paper the question of diversification. Evidence reported elsewhere (cf. Jensen ) indicates the funds on average have done an excellent job of minimizing the “insurable” risk born by their shareholders. Thus the results reported here should not be construed as indicating the mutual funds are not providing a socially desirable service to investors; that question has not been addressed here. The evidence does indicate, however, a pressing need on the part of the funds themselves to evaluate much more closely both the costs and the benefits of their research and trading activities in order to provide investors with maximum possible returns for the level of risk undertaken.
Assuming, of course, that investors' expectations are on average correct.
It is also interesting to note that the measure of performance suggested below is in many respects quite closely related to the measure suggested by Treynor .
Defined as the ratio of capital gains plus dividends to the initial price of the security. (Note, henceforth we shall use the terms asset and security interchangeably.)
Note that since σ2(R̃M) is constant for all securities the risk of any security is just cov(R̃j, R̃M). But since cov(R̃M, R̃M) = σ2(R̃M) the risk of the market portfolio is just σ2(R̃M), and thus we are really measuring the riskiness of any security relative to the risk of the market portfolio. Hence the systematic risk of the market portfolio, cov(R̃M, R̃M)/σ2(R̃M), is unity, and thus the dimension of the measure of systematic risk has a convenient intuitive interpretation.
The “market model” given in eqs. (2) and (3a)−(3d) is in spirit identical to the “diagonal model” analyzed in considerable detail by Sharpe [19, 22] and empirically tested by Blume . The somewhat more descriptive term “market model” was suggested by Fama . The “diagonal model” is usually stated as
where Ĩ is some index of market returns, ũj is a random variable uncorrelated with Ĩ, and aj and bj are constants. The differences in specification between (2) and (2a) are necessary in order to avoid the overspecification (pointed out by Fama ) which arises if one chooses to interpret the market index I as an average of security returns or as the returns on the market portfolio, M (cf., [15, 20]). That is, if I is some average of security returns then the assumption that ũj is uncorrelated with Ĩ (equivalent to (3c)) cannot hold since Ĩ contains ũj.
The return on the market portfolio is given by where Xj is the ratio of the total value of the j'th asset to the total value of all assets. Thus by substitution from (2) we have
Note that the first term on the right hand side of (3) is just E(R̃Mt), and since the market factor π is unique only up to a transformation of scale (cf. ) we can scale π such that and the second term becomes just π. Furthermore by assumption, the ẽjt in the third term are independently distributed random variables with E(ẽjt) = 0, and empirical evidence indicates that the σ2(ẽj) are roughly of the same order of magnitude as (cf. [1, 13]). Hence the variance of the last term on the right hand side of (3), given by
will be extremely small since on average Xj will be equal to 1/N, and N is very large. But since the expected value of this term is zero, and since we have shown its variance is extremely small, it is unlikely that it will be very different from zero at any given time. Thus to a very close approximation the returns on the market portfolio will be given by eq. (4).
Note that the parameters βj (in (1a)) and bj (in (2)) are not subscripted by t and are thus assumed to be stationary through time. Jensen  has shown (2) to be an empirically valid description of the behavior of the returns on the portfolios of 115 mutual funds, and Blume  has found similar results for the behavior of the returns on individual securities.
In addition it will be shown below that any non-stationarity which might arise from attempts to increase returns by changing the riskiness of the portfolio according to forecasts about the market factor π lead to relatively few problems.
Evidence given in  suggests this is true.
In the statistical sense of the term.
If ũjt were not serially independent the manager could increase his return even more by taking account of the information contained in the serial dependence and would therefore eliminate it.
See note 8 above.
Perhaps by shifting resources out of bonds and into equities, or if no bonds are currently held, by shifting into higher risk equities or by borrowing funds and investing them in equities.
The data were obtained primarily from the 1955 and 1965 editions of Wiesenberger , but some data not available in these editions were taken from the 1949–54 editions. Data on the College Retirement Equities Fund (not listed in Wiesenberger) were obtained directly from annual reports.
All per share data were adjusted for stock splits and stock dividends to represent an equivalent share as of the end of December 1964.
Obtained from . Prior to March 1, 1957, the S & P index was based on only 90 securities (50 industrials, 20 rails and 20 utilities) and hence for the earlier period the index is a poorer estimate of the returns on the market portfolio.
Obtained from . Since the use of this moving average introduces measurement errors in the index returns it would be preferable to use an index of the actual dividends, but such an index is not available.
Note that while most funds pay dividends on a quarterly basis we treat all dividends as though they were paid as of December 31 only. This assumption of course will cause the measured returns on the fund portfolios on average to be below what they would be if dividends were considered to be reinvested when received, but the data needed to accomplish this are not easily available. However, the resulting bias should be quite small. In addition, the same bias is incorporated into the measured returns on the market portfolio.
The loading charges have been ignored since our main interest here is not to evaluate the funds from the standpoint of the individual investor but only to evaluate the fund managers' forecasting ability.
Actual expense data were available only for the 10 years 1955–64. Therefore in estimating gross returns for the years 1945–54 the expense ratio for 1955 was added (before adjustment to a continuous base) to the returns for these earlier years.
It would be desirable to use the fund returns gross of all expenses including brokerage commissions as well as the management expenses. However, overall commission data are not yet available.
The t value for 5% level of significance (one-tail) with 8 degrees of freedom (the minimum in the sample) is 1.86 and for 18 degrees of freedom (the maximum in the sample) is 1.73.
Except for the assumption of a fully invested portfolio which we have allowed for by assuming cash earned interest at the riskless rate.
See Section II.
- 1“The Assessment of Portfolio Performance”, (unpublished Ph.D. dissertation, University of Chicago, 1968).
- 2“An Empirical Evaluation of Alternative Portfolio Selection Models”, Journal of Business, XXXX (April, 1967), 166–193.and
- 3Pension Funds: Measuring Investment Performance. New York: The Free Press, 1966.
- 4“The Behavior of Stock-Market Prices”, Journal of Business, XXXVII (January, 1965), 34–105.
- 5“Risk, Return and Equilibrium: Some Clarifying Comments”, Journal of Finance, XXIII (March, 1968), 29–40.
- 6“Risk, Return, and General Equilibrium in a Stable Paretian Market”, (unpublished manuscript, University of Chicago, 1967).
- 7The Investment Decision Under Uncertainty. Englewood Cliffs, N.J.: Prentice Hall, Inc., 1962.
- 8A Study of Mutual Funds. Washington, D.C.: U.S. Government Printing Office, 1962., , and
- 9“Portfolio Selection and Investment Performance”, Journal of Finance, XX (September, 1965), 391–415., , and
- 10“A Model for Mutual Fund Evaluation”, Industrial Management Review, VI (Spring, 1965), 81–92.
- 11“Risk, the Pricing of Capital Assets, and the Evaluation of Investment Portfolios”, (unpublished preliminary draft of PhD. Thesis, University of Chicago, July, 1967).
- 12Econometric Methods. New York: McGraw Hill, Inc., 1963..
- 13“Market and Industry Factors in Stock Price Behavior”, Journal of Business, XXXIX, Part II (January, 1966), 139–190.
- 14“The Valuation of Risk Assets and the Selection of Risky Investments in Stock Portfolios and Capital Budgets”, Review of Economics and Statistics, XLVII (February, 1965), 13–37.
- 15“Security Prices, Risk, and Maximal Gains from Diversification”, Journal of Finance, XX (December, 1965), 587–616.
- 16“The Variation of Certain Speculative Prices”, Journal of Business, XXXVI (October, 1963), 394–419.
- 17Portfolio Selection: Efficient Diversification of Investments. Cowles Foundation Monograph No. 16. New York: John Wiley & Sons, Inc., 1959..
- 18“The Efficient Market Model Applied to U.S. Treasury Bills”, (unpublished PhD. dissertation, University of Chicago, 1968).
- 19“A Simplified Model for Portfolio Analysis”, Management Science (January, 1963), 277–293.
- 20“Capital Asset Prices: A Theory of Market Equilibrium Under Conditions of Risk”, Journal of Finance, XIX (September, 1964), 425–442.
- 21“Mutual Fund Performance”, Journal of Business XXXIX, Part 2 (January, 1966), 119–138.
- 22“Linear Programming Algorithm for Mutual Fund Portfolio Selection”, Management Science, XIII (March, 1967), pp. 499–510.
- 23Standard and Poor's. Trade and Securities Statistics: Security Price Index Record. (Orange, Conn.: Standard and Poor's Corporation, 1964).
- 24“How to Rate Management of Investment Funds”, Harvard Business Review, XLIII (JanuaryFebruary, 1965), 63–75.
- 25“Toward a Theory of Market Value of Risky Assets”, (unpublished manuscript, undated).
- 26Investment Companies. New York: Arthur Wiesenberger & Company, 1955 and 1965.
- 27“Linear Estimators for Linear Regression Systems Having Infinite Variances”, (unpublished paper presented at the Berkeley-Stanford Mathematical Economics Seminar, October, 1963).