November 12th, 2020
CIAT-Beans EA
Reducing cycle time to increase rates of gain is not being fully exploited in classical programs neither the use of genomic prediction to boost even more the reduction of cycle time. Playing factors are not fully understood.
Crossing, Evaluation, Selection
L
\(\Delta_g = (i * \sigma_g * r)/L\)
Using genomic prediction in the recycling (selection) process to reduce the cycle time could increase the rate of genetic gain.
Treatment | Description |
---|---|
TPn_PPn_SPf4f5 | TrainingPop=NULL,PredictedPop=NULL,RecyclingPop=F4-F5 using an index. |
TPn_PPn_SPf5f6 | TrainingPop=NULL,PredictedPop=NULL,RecyclingPop=F5-F6 using an index. |
TPn_PPn_SPf6f7 | TrainingPop=NULL,PredictedPop=NULL,RecyclingPop=F6-F7 using an index. |
TPn_PPn_SPf7f8 | TrainingPop=NULL,PredictedPop=NULL,RecyclingPop=F7-F8 using an index. |
TPn_PPn_SPf7f8_NF | TrainingPop=NULL,PredictedPop=NULL,RecyclingPop=F7-F8 using an index with no family selection. |
TPf5f6_PPf1_SPf1 | TrainingPop=F5-F6, PredictedPop=F1, RecyclingPop=F1 using an index. |
A 20 year burn-in period was used. Burn-in was followed by a 20 year evaluation period to measure rates of genetic gain in F9 lines. Genotype-by-year, genotype-by-location interaction variances were assumed to be equivalent to main genetic variance. 25 replications done. We simulated 5 complex and 3 simple traits to be behind the genetic merit. TP=random, N.TP=3K, N.Markers=5K.
The plateu reached by GP can be attributed to quick genetic variance depletion and the drift in simple traits.
Treatment | Description |
---|---|
TPf5f6_PPf1_SPf1 | TrainingPop=F5-F6,PredictedPop=F1,RecyclingPop=F1 using an index. |
TPf5f6_PPf1_SPf1_OCS | TrainingPop=F5-F6,PredictedPop=F1,RecyclingPop=F1 using an index + optimal contribution. |
TPf5f6_PPf1_SPf1_2S | TrainingPop=F5-F6,PredictedPop=F1,RecyclingPop=F1 using an index + 2-step selection (simple traits => complex traits). |
TPf5f6_PPf1_SPf1_NN | TrainingPop=F5-F6,PredictedPop=F1,RecyclingPop=F1 using an index + increased #parents. |
TPn_PPn_SPf7f8_NF | TrainingPop=NULL,PredictedPop=NULL,RecyclingPop=F7-F8 using an index with no family selection. |
A 20 year burn-in period was used. Burn-in was followed by a 20 year evaluation period to measure rates of genetic gain in F9 lines. Genotype-by-year, genotype-by-location interaction variances were assumed to be equivalent to main genetic variance. 25 replications done. We simulated 5 complex and 3 simple traits to be behind the genetic merit. TP=random, N.TP=3K, N.Markers=5K.
OCS methodologies help to avoid a quick depletion of genetic variance. Increasing the number of parents to face the reduction of Ne (because of less cohorts) can also help. A different treatment of simple and complex traits in the application of GP makes a big difference.
Treatment | Description |
---|---|
TPf5f6_PPf1_SPf1_OCS | TrainingPop=F5-F6, PredictedPop=F1, RecyclingPop=F1 using an index+OCS. |
TPf6f7_PPf1_SPf1_OCS | TrainingPop=F6-F7, PredictedPop=F1, RecyclingPop=F1 using an index+OCS. |
TPf7f8_PPf1_SPf1_OCS | TrainingPop=F7-F8, PredictedPop=F1, RecyclingPop=F1 using an index+OCS. |
A 20 year burn-in period was used. Burn-in was followed by a 20 year evaluation period to measure rates of genetic gain in F9 lines. Genotype-by-year, genotype-by-location interaction variances were assumed to be equivalent to main genetic variance. 25 replications done. We simulated 5 complex and 3 simple traits to be behind the genetic merit. TP=random, N.TP=3K, N.Markers=5K.
Training the model with STG1 and STG2 data generated earlier or later doesn’t seem to have an important effect in the long term but in early years could be more important. The earlier the data feeds the model the best. Closing the gap between the TP and PP.
Treatment | Description |
---|---|
TPf5f6_PPf1_SPf1_OCS_FIRST | TrainingPop=F5-F6 using 3K old individuals, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
TPf5f6_PPf1_SPf1_OCS_LAST | TrainingPop=F5-F6 using 3K recent individuals, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
TPf5f6_PPf1_SPf1_OCS_RANDOM | TrainingPop=F5-F6 using 3K random individuals, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
A 20 year burn-in period was used. Burn-in was followed by a 20 year evaluation period to measure rates of genetic gain in F9 lines. Genotype-by-year, genotype-by-location interaction variances were assumed to be equivalent to main genetic variance. 25 replications done. We simulated 5 complex and 3 simple traits to be behind the genetic merit. TP=varied, N.TP=3K, N.Markers=5K.
The use of recent or random individuals is preferable over old data. The use of recent data showed to be better than a random sample but not for much.
Treatment | Description |
---|---|
TPf5f6_PPf1_SPf1_OCS_N500 | TrainingPop=F5-F6 using 500 random individuals 5K markers, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
TPf5f6_PPf1_SPf1_OCS_N1000 | TrainingPop=F5-F6 using 1000 random individuals 5K markers, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
TPf5f6_PPf1_SPf1_OCS_N3000 | TrainingPop=F5-F6 using 3000 random individuals 5K markers, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
TPf5f6_PPf1_SPf1_OCS_N5000 | TrainingPop=F5-F6 using 5000 random individuals 5K markers, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
A 20 year burn-in period was used. Burn-in was followed by a 20 year evaluation period to measure rates of genetic gain in F9 lines. Genotype-by-year, genotype-by-location interaction variances were assumed to be equivalent to main genetic variance. 25 replications done. We simulated 5 complex and 3 simple traits to be behind the genetic merit.TP=random, N.TP=varied, N.Markers=5K.
The greater the number of individuals in the model the best. We limited our scenario to what was computanionally feasible. But the learning is that the more data is available the best.
Treatment | Description |
---|---|
TPf5f6_PPf1_SPf1_OCS_M500 | TrainingPop=F5-F6 using 3K random individuals 500 markers, PredictedPop=F1, RecyclingPop=F1 using a base index + optimal contribution. |
TPf5f6_PPf1_SPf1_OCS_M1000 | Same but 1000 markers |
TPf5f6_PPf1_SPf1_OCS_M2500 | Same but 2500 markers |
TPf5f6_PPf1_SPf1_OCS_M5000 | Same but 5000 markers |
TPf5f6_PPf1_SPf1_OCS_M10000 | Same but 10000 markers |
TPf5f6_PPf1_SPf1_OCS_M30000 | Same but 30000 markers |
A 20 year burn-in period was used. Burn-in was followed by a 20 year evaluation period to measure rates of genetic gain in F9 lines. Genotype-by-year, genotype-by-location interaction variances were assumed to be equivalent to main genetic variance. 25 replications done. We simulated 5 complex and 3 simple traits to be behind the genetic merit. TP=random, N.TP=3K, N.Markers=varied.
The number of markers seems to be important when is too low but when moving to the thousands of markers the differences among treatments are less. We expect this can change with different TP sizes. A grid is needed to find a definite answer but seems that 5-10 K markers may be enough.
We highly recommend the use of GP for reducing cycle time if all proper steps have been adopted. Watch out for: 1) Recycling early using GP models increases drastically the genetic gain but exahusts diversity much quicker. 2) Do not jump into GP to reduce cycle time if you haven’t sorted out the logistics. 3) Training models should have data for all traits, not only yield.
Practical recommendations for the adoption of GP for reducing cycle time include:
1) Implement a closed system to keep training models accurate.Do not try to predict exotic germplasm or use exotic germplasm to predict the elite population.
2) Exhausting the variance is inevitable and a good selection process needs to be used together with measures like using the right number of parents and OCS.
3) The time separation between TP and PP doesn’t seem to affect much once the method is stablished.
4) The number of individuals in the training model (assuming a closed system) has great impact and we recommend this use as much information as as possible (informations from the same pool, not exotic or research pools).
5) The number of markers used to capture the relationship between the TP and the PP is important but several thousand markers (i.e. 5K) is enough to capture the relationships well for several decades in a closed system.