PORTFOLIO THEORY โ€“ LECTURE NOTES 2 Dr. Andrea Rigamonti FACTOR INVESTING WITH LONG-SHORT PORTFOLIOS Computing optimal weights is not the only possibility. An alternative approach involves creating long-short portfolios. Suppose we want to invest into N assets. First, assets are ranked according to their predicted return. We then assemble a portfolio with two legs: a long leg which contains a given number of assets with the highest predicted returns, and a short leg with a given number of assets predicted to have the lowest returns. Within a certain leg the assets are often equally weighted, although other weighting systems that assign different weights based on the predicted return are of course possible. One of the advantages of such an approach is that it allows the investor to consider predicted returns without suffering the full consequences of the estimation error in the mean. Optimization procedures like the mean-variance one are in fact error-maximizing: errors in the inputs lead to extreme weights, which can lead to abysmal performance. This is why standard mean-variance optimization rarely works well and is generally replaced either with more advanced techniques that limit extreme allocations, or with a minimum variance portfolio. A long-short portfolio does not have theoretically optimal weights, but it can still work better by avoiding this error-maximization trap. The other advantage is that a long-short portfolio can be self-financing: the money obtained from shorting the assets predicted to perform poorly is used to go long on the assets with predicted high return. As short positions tend to be more risky than long positions, using a partially self-financing portfolio is also common. In this case, less than half (e.g., 30%) of the wealth is placed on the short leg, and the long leg is financed partly from the shorting and partly from the investorโ€™s initial wealth (in our example where 70% of the invested sum goes to the long leg, 30% of the money comes for the shorting and the other 40% from the investorโ€™s funds). A practical disadvantage of such a portfolio is that in the real world it is generally difficult to short a large number of stocks. So in practice N has to be relatively small, and therefore it can be more risky, as it is not very well diversified. A long-short portfolio can also be particularly vulnerable in turbulent market conditions, when the price of virtually all stocks are either increasing or decreasing at the same time. The latter problem can be eased by making the portfolio construction more flexible (e.g., by varying the number of stocks and/or the amount of wealth in the long and short leg depending on the market conditions). Obviously, a pre-condition for creating a long-short portfolio is having a ranking based on how we expect the assets to perform. How can we obtain it? Using the sample means is not appropriate, as we pointed out that such estimates are too unreliable. A much better alternative is to use a factorbased approach. In fact, long-short portfolios are a typical way factor investing is performed. This generally involves computing expected returns using multifactor models. Remember that the formula of a multifactor model with ๐‘˜ factors is: ๐‘…๐‘– = ๐›ผ๐‘– + ๐‘๐‘–1 ๐‘“1 + ๐‘๐‘–2 ๐‘“2 + โ‹ฏ + ๐‘๐‘–๐‘˜ ๐‘“๐‘˜ + ๐œ€๐‘– In practice, the expected return of a stock given a certain multifactor model is computed as: ๐ธ[๐‘…๐‘–] = ๐›ผ๐‘– + ๐‘๐‘–1 ๐›พ1 + ๐‘๐‘–2 ๐›พ2 + โ‹ฏ + ๐‘๐‘–๐‘˜ ๐‘“๐›พ ๐‘˜ where ๐›พ is the factor risk premium.1 Therefore, we need to estimate the loadings ๐‘๐‘–๐‘˜ and the risk premia. This is typically done using the Fama-MacBeth regression. It is a two-stage linear regression. Consider an estimation sample with ๐‘ assets and ๐‘‡ periods. In the first stage, the loadings are estimated by regressing the returns of each asset ๐‘– on the ๐‘˜ factors, using the entire set of ๐‘‡ periods: ๐‘…1๐‘ก = ๐›ผ1 + ๐‘11 ๐‘“1 ๐‘ก + ๐‘12 ๐‘“2๐‘ก + โ‹ฏ + ๐‘1๐‘˜ ๐‘“๐‘˜ ๐‘…2๐‘ก = ๐›ผ2 + ๐‘21 ๐‘“1๐‘ก + ๐‘22 ๐‘“2๐‘ก + โ‹ฏ + ๐‘2๐‘˜ ๐‘“๐‘˜ โ‹ฎ ๐‘…๐‘–๐‘ก = ๐›ผ๐‘– + ๐‘๐‘–1 ๐‘“1 ๐‘ก + ๐‘๐‘–2 ๐‘“2๐‘ก + โ‹ฏ + ๐‘๐‘–๐‘˜ ๐‘“๐‘˜ โ‹ฎ ๐‘… ๐‘๐‘ก = ๐›ผ ๐‘ + ๐‘ ๐‘1 ๐‘“1 ๐‘ก + ๐‘ ๐‘2 ๐‘“2๐‘ก + โ‹ฏ + ๐‘ ๐‘๐‘˜ ๐‘“๐‘˜๐‘ก The estimated loadings are then used as explanatory variables in a second regression that, for each period ๐‘ก, regresses the asset returns of the entire set of ๐‘ assets: ๐‘…๐‘–1 = ๐›พ10 + ๐›พ11 ๐‘๐‘–1 ฬ‚ + ๐›พ12 ๐‘๐‘–2 ฬ‚ + โ‹ฏ + ๐›พ1๐‘˜ ๐‘๐‘–๐‘˜ ฬ‚ ๐‘…๐‘–2 = ๐›พ20 + ๐›พ21 ๐‘๐‘–1 ฬ‚ + ๐›พ22 ๐‘๐‘–2 ฬ‚ + โ‹ฏ + ๐›พ2๐‘˜ ๐‘๐‘–๐‘˜ ฬ‚ โ‹ฎ ๐‘…๐‘–๐‘ก = ๐›พ๐‘ก0 + ๐›พ๐‘ก1 ๐‘๐‘–1 ฬ‚ + ๐›พ๐‘ก2 ๐‘๐‘–2 ฬ‚ + โ‹ฏ + ๐›พ๐‘ก๐‘˜ ๐‘๐‘–๐‘˜ ฬ‚ โ‹ฎ ๐‘…๐‘–๐‘‡ = ๐›พ ๐‘‡0 + ๐›พ ๐‘‡1 ๐‘๐‘–1 ฬ‚ + ๐›พ ๐‘‡2 ๐‘๐‘–2 ฬ‚ + โ‹ฏ + ๐›พ ๐‘‡๐‘˜ ๐‘๐‘–๐‘˜ ฬ‚ Ideally we should use the true loadings, but their value is of course unknown in practice. To compute the expected returns of each asset ๐‘– we need the loadings, estimated in the first regression, and the risk premia, estimated in the second regression. Notice however that the risk premia are time-varying. A common approach is to compute their average value over the ๐‘‡ periods (just like it is common to compute the average market excess return when using the CAPM). The expected return of asset ๐‘– according to the chosen multifactor model is given by (we omit the ^ to keep the notation light): ๐ธ[๐‘…๐‘–] = ๐‘๐‘–1 ๐›พ1 + ๐‘๐‘–2 ๐›พ2 + โ‹ฏ + ๐‘๐‘–๐‘˜ ๐›พ ๐‘˜ For greater clarity, let us consider how this works with the Fama-French three-factor model, which is probably the most important factor model. Recall that the model is: ๐‘…๐‘– = ๐‘…๐‘“ + ๐‘๐‘–1(๐‘… ๐‘š โˆ’ ๐‘…๐‘“) + ๐‘๐‘–2 ๐‘†๐‘€๐ต + ๐‘๐‘–3 ๐ป๐‘€๐ฟ In practice the expected return of asset ๐‘– will be computed as: ๐ธ[๐‘…๐‘–] = ๐‘…๐‘“ + ๐‘๐‘–1 ๐›พ(๐‘… ๐‘šโˆ’๐‘… ๐‘“) + ๐‘๐‘–2 ๐›พ๐‘†๐‘€๐ต + ๐‘๐‘–3 ๐›พ ๐ป๐‘€๐ฟ We use the Fama-MacBeth regression to estimate the loadings and the risk premia. Usually, the excess return is used as dependent variable, to focus on the component of the return that is dependent on factor exposure. Therefore, the first stage regression for each asset ๐‘– is: ๐‘…๐‘–๐‘ก โˆ’ ๐‘…๐‘“๐‘ก = ๐›ผ๐‘– + ๐‘๐‘–1(๐‘… ๐‘š๐‘ก โˆ’ ๐‘…๐‘“๐‘ก) + ๐‘๐‘–2 ๐‘†๐‘€๐ต๐‘ก + ๐‘๐‘–3 ๐ป๐‘€๐ฟ ๐‘ก 1 In the CAPM, and in single factor models in general, we can directly use the factor value (the excess market return in the case of CAPM). In multifactor models we cannot do this, and we need to use the risk premia of the factors instead. To simplify the notation, we indicate the first factor as ๐‘€๐พ๐‘‡: ๐‘…๐‘–๐‘ก โˆ’ ๐‘…๐‘“๐‘ก = ๐›ผ๐‘– + ๐‘๐‘–1 ๐‘€๐พ๐‘‡๐‘ก + ๐‘๐‘–2 ๐‘†๐‘€๐ต๐‘ก + ๐‘๐‘–3 ๐ป๐‘€๐ฟ ๐‘ก As explained before, this regression needs to be carried out separately for each of the ๐‘ assets. Now that we have the estimates for the loadings, we can set up the second stage regression: ๐‘…๐‘– โˆ’ ๐‘…๐‘“ = ๐›พ๐‘ก0 + ๐›พ๐‘ก1 ๐‘๐‘–1 ฬ‚ + ๐›พ๐‘ก2 ๐‘๐‘–2 ฬ‚ + ๐›พ๐‘ก3 ๐‘๐‘–3 ฬ‚ This regression needs to be carried out separately for each of the ๐‘‡ periods in the estimation window, obtaining ๐‘‡ values for ๐›พ๐‘ก1, ๐›พ๐‘ก2 and ๐›พ๐‘ก3 . We then compute their average in order to have a single value. We rename the average of ๐›พ๐‘ก1, ๐›พ๐‘ก2 and ๐›พ๐‘ก3 as ๐›พ ๐‘€๐พ๐‘‡, ๐›พ๐‘†๐‘€๐ต and ๐›พ ๐ป๐‘€๐ฟ respectively, for better clarity. We also compute the average risk-free rate in order to have a single value for ๐‘…๐‘“.2 We can now compute the expected return of each asset ๐‘– as: ๐ธ[๐‘…๐‘–] = ๐‘…๐‘“ + ๐‘๐‘–1 ๐›พ ๐‘€๐พ๐‘‡ + ๐‘๐‘–2 ๐›พ๐‘†๐‘€๐ต + ๐‘๐‘–3 ๐›พ ๐ป๐‘€๐ฟ It is now straightforward to create the long-short portfolio. We simply rank the assets according to their expected return, and take a long position on those positioned in the upper part of the ranking, and a short position on those in lower part of the ranking. Each time the portfolio has to be updated, we need to compute new estimates of the expected returns. So, for example, if we want to update the portfolio monthly, we ne need to repeat the procedure every month, using the up-to-date data. 2 Computing the average value of the risk premia and of the risk-free rate is a reasonable approach, and the one commonly used. However, it is not โ€œtheโ€ right approach. Other approaches might also be appropriate depending on the specific situation.