Rule 2: A Small Number of Rules
Thursday, 23 April 2009

Consider two well known trend following systems
The common dual moving average system has just two rules. One says to buy the up¬side crossover, and the other says to sell the downside crossover. Similarly, the popular 20-bar breakout system has at least four rules, two each for entries and exits. You can show with testing software that these systems are profitable over many markets across multi year time frames.
You can contrast this approach with an expert system based trading system that may have hundreds of rules. For example, one commercially available system apparently has more than 400 rules. However, it turns out that only one rule is the actual trigger for the trades. The deterministic systems differ from neural-net-based systems that may have an unknown number of rules.
The statistical theory of design of experiments says that even complex processes are controllable using five to seven "main" variables. It is rare for a process to depend on more than ten main variables, and it is quite difficult to reliably control a process that depends on 20 or more variables. It is also rare to find processes that depend on the interactions of four or more variables. Thus, the effect of higher-order interactions is usually insignificant. The goal is to keep the overall number of rules and variables as small as possible.
There are many hazards in designing trading systems with a large number of rules. First, the relative importance of rules decreases as the number of rules increases. Second, the degrees of freedom decrease as the number of rules or variables increases. This means larger amounts of test data are needed to get valid results as the number of rules or variables increases.
A third problem is the danger of curve-fitting the data in the test sample. For example, given a data set, a simple linear regression with just
Principles of Trading System Design
two variables may fit the data adequately. As the number of variables in the regression increases to, say, seven, the line fits the data more closely. Therefore, we can pick up nuances in the data when we curve-fit our trading system, only to pick up patterns that may never repeat in the future. The total degrees of freedom decrease by two for the simple linear regression, but will decrease by seven for the polynomial regression.