DAVID BALDING
Professor of Statistical Genetics,

Imperial College London

POSTAL ADDRESS:

Department of Epidemiology & Public Health
Imperial College
, St Mary's Campus
Norfolk Place
, London W2 1PG

 

Local map

Phone:

Desk +44 (0)20 7594 3309 internal 43309

Mobile +44 (0)75 9025 0432 internal 50432
e-mail:
D.Balding@ic.ac.uk

PA Madeline Kirk: M.Kirk@ic.ac.uk

phone +44 (0)20 7594 3319 internal 43319

LINKS:

Centre for Biostatistics

My official IC website

 

           


SOME FUTURE EVENTS I AM ORGANISING

  • A session on Forensic Statistics at the European Meeting of Statisticians, Toulouse, July 20 – 24, 2009. The invited speakers will be Steffen Lauritzen (Oxford), Silvia Bozza (Venezia) and Roberto Puch-Solis (Birmingham).
  • A one-week workshop with a four-week programme on Statistical Challenges Arising from Genome Resequencing at the Isaac Newton Institute, Cambridge, July 12 – Aug 6, 2010.  Co-organisers are Chris Holmes and Gil McVean (Oxford) and Matthew Stephens (Chicago).

SOME COMMITTEES I AM A MEMBER OF

I’ve also recently been part of the German Federal Ministry of Education and Research “Long-Term Studies in Health Research" panel (2007-08).

 


PEOPLE WORKING WITH ME:

  • Jean-Marie Cornuet, from the Centre de Biologie et de Gestion des Populations at INRA, Montpellier, France, arrived Sept 2006 to commence a 2-year Marie Curie Intra-European Fellowship with us, on a project entitled "Statistical inference in population genetics".

 

PhD Students (see below for past PhD students):

Post-docs:

Charlotte Vignal (part-time, GSK, started Dec 2004)

Federico Calboli (was BBSRC now GSK, started Jan 2005)

Shu-Yi Su (started Oct 2005)

Delilah Zabaneh (BHF, started April 05)

Jon White (part-time, NIAB, started April 07)

Adaikalavan Ramasamy (EU, started Jan 08) Joint with NHLI

 

Matt Nunes (EPSRC, started Aug 2008)

 

FORMER POSTDOCS

 

  • William Astle (Oct 07 – Mar 09) is now working on another project in the Department.
  • Toby Andrew (Sept 05 – Mar 08) is now a Lecturer in Genetic Epidemiology at King’s College London
  • Marc Chadeau (Dec 06 – Feb 08) is now working on another project in the Department.
  • Clive Hoggart (Sept 04 – Feb 08) is now working on another project in the Department.
  • David Welch, (May 2006 – Dec 2007 is now at Penn State U.
  • In Oct 06 Taane Clark (postdoc funded by MRC from Feb 2005) moved to the Sanger Institute, Hinxton.
  • Photo of people working with me in 2000 at Reading (Grainne, Andrew, Patrick, Chiara, Karen, in Three Tuns pub) here.

BOOKS:


 

RECENT PUBLICATIONS (since start 2007) AND PAPERS IN PRESS (earlier publications)


Balding DJ (2009). DNA evidence: Interpretation of DNA profiles, in Encyclopedia of Forensic Science, Jamieson, A., Moenssens, A. (eds).  Wiley Chichester, UK, pp 2365-2376.


Balding DJ, Buckleton J, Interpreting low template DNA profiles, to appear Forensic Science International: Genetics, 2009, doi: 10.1016/j.fsigen.2009.03.003


Meyre D, Delplanque J, Chevre J-C, Lecoeur C, Lobbens S + 31 authors + Balding D, Walley A, Dina C, Froguel P; Genome-wide association study for early-onset and morbid adult obesity identifies three new loci in European populations, Nature Genetics 41, 157–159, 2009. doi:10.1038/ng.301


Vignal C, Bansal AT, Balding DJ, Binks MH, Dickson MC, Montgomery DS, Wilson AG; Genetic Association of the Major Histocompatibility Complex With Rheumatoid Arthritis Implicates Two Non-DRB1 Loci, Arthritis & Rheumatism 60(1): 5362, 2009, DOI 10.1002/art.24138.


Su S-Y, White J, Balding DJ, Coin LJM; Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions, BMC Bioinformatics, 2008, 9:513.


Cornuet JM; Santos F; Beaumont MA; Robert CP; Marin JM; Balding DJ; Guillemaud T; Estoup A, Inferring population history with DIY ABC: a user-friendly approach to Approximate Bayesian Computation. Bioinformatics 24(23): 2713–2719, 2008; doi: 10.1093/bioinformatics/btn514


Chadeau-Hyam M, Hoggart CJ, O'Reilly PF, Whittaker JC, De Iorio M, Balding DJ, Simulation of realistic sequence-level data in populations and ascertained samples. BMC Bioinformatics 2008, 9:364.


Hoggart CJ, Whittaker JC, De Iorio M, Balding DJ, Simultaneous Analysis of all SNPs in Genome–Wide and Re–sequencing Association Studies, PLoS Genetics 4(7): e1000130, 2008. doi:10.1371/journal.pgen.1000130


O’Reilly PF, Birney E, Balding DJ, Confounding between recombination and selection, and the Ped/Pop method for detecting selection, Genome Research 18: 1304-1313, August 2008, doi:10.1101/gr.067181.107


Chambers JC, Elliott P, Zabaneh D, Zhang W, Li Y, Froguel P, Balding D, Scott J, Kooner JS, Common genetic variation near the melanocortin-4 receptor gene is associated with waist circumference and insulin resistance, Nature Genetics 40, 716 – 718, June  2008. doi:10.1038/ng.156


Calboli FC , Sampson J,  Fretwell N, Balding DJ, Population structure and inbreeding from pedigree analysis of purebred dogs, Genetics, 179(1): 593–601, May 2008.  doi:10.1534/genetics.107.084954


Su SY, Balding DJ, Coin LJM, Disease Association Tests by Inferring Ancestral Haplotypes Using a Hidden Markov Model, Bioinformatics, 24: 972-978, April 2008. doi:10.1093/bioinformatics/btn071


Hoggart CJ, Clark TG, De Iorio M, Whittaker JC, Balding DJ, Genome-Wide Significance for Dense SNP and Resequencing Data, Genetic Epidemiology, 32(2): 179-185, February 2008.


Ioannidis JPA, Boffetta P, Little J, O’Brien TR, Uitterlinden AG, Vineis P, Balding DJ, Chokkalingam A, Dolan SM, Flanders WD, Higgins JPT, McCarthy MI, McDermott DH, Page GP, Rebbeck TR, Seminara D, Khoury MJ. Assessment of Cumulative Evidence on Genetic Associations: Interim Guidelines, Int. J. Epidem. 37(1):120-132, February 2008  doi:10.1093/ije/dym159


Leschziner GD, Jorgensen AL, Andrew T, Williamson PR + 3 authors + Balding DJ + 3 authors + Johnson MR, Pirmohamed M, The association between polymorphisms in RLIP76 and drug response in epilepsy, Pharmacogenomics, 8(12): 1715-1722, 2007.


Hoggart CJ, Chadeau-Hyams M, Clark TG, Lampariello R, De Iorio M, Whittaker JC, Balding DJ, Sequence-level population simulations over large genomic regions, Genetics 177: 1725–1731, 2007 doi:10.1534/genetics.106.069088


Clark TG, Andrew T, Cooper GM, Marguiles EH, Mullikin JC, Balding DJ. Functional constraint and small insertions and deletions in the ENCODE regions of the human genome Genome Biology, 8:R180, 2007  doi:10.1186/gb-2007-8-9-r180


Graffelman J; Balding DJ; Gonzalez A; Bertranpetit J.  Variation in Estimated Recombination Rates across Human Populations, Human Genetics, 122(3-4):301-310, 2007. doi:10.1007/s00439-007-0391-6


Leschziner GD; Andrew T; Leach JP; Chadwick D; Coffey AJ; Balding DJ; Bentley DR; Pirmohamed M; Johnson MR.  Common ABCB1 polymorphisms are not associated with multidrug resistance in epilepsy using a gene-wide tagging approach, Pharmacogenetics and Genomics 17: 217-220, 2007.


A genome-wide association study identifies novel risk loci for type 2 diabetes, Sladek R, Rocheleau G, Rung J, Dina C + 14 authors + Balding DJ, Meyre D, Polychronakos C, Froguel P, Nature 445: 881-885 2007. doi:10.1038/nature05616


Family-based association analysis with ordered categorical phenotypes, covariates and interactions, Baksh MF, Balding DJ, Vyse TJ & Whittaker JC, Genetic Epidemiology 31(1): 1-8, 2007. doi:10.1002/gepi.20183


SOFTWARE

  • HYPERLASSO software for simultaneous analysis of all SNPs and covariates in a GWAS can be found at the Bargen project webpage at the EBI. See Hoggart et al. 2008 (above).
  • FREGENE C++ software for simulating sequence-like data in large genomic regions in entire populations (say, 1 Mb in 100K individuals, or 20 Mb in 5K individuals), under a flexible range of scenarios for recombination (including gene conversion), demography (population growth and structure) and selection (directional and balancing) is available from the Bargen project webpage at the EBI.  See Hoggart et al, Genetics, 2007 and Chadeau-Hyam, BMC Bioinformatics (full references above).
  • HAPCLUSTER software for confirming and fine-mapping genetic associations in candidate regions, using haplotype clustering, is available from Thomas Mailund's homepage.  The original R code described in Waldron, Whittaker and Balding (2006) only analysed phased haplotype data, and implemented a general risk model (separate risk for each genotype at the postulated causal locus).  Thomas has recoded it in C++ and extended it to handle unphased genotype data, to output the Bayes Factor assessing the evidence against the null hypothesis of no association, and to implement an allelic disease model.  The latter should give more power for near-multiplicative disease risks, but may be affected by deviations from Hardy-Weinberg equilibrium.  The latest version (2.1.5) also allows unphenotyped individuals, to facilitate imputation e.g. from HapMap individuals.
  • BAYESFST software for Bayesian hierarchical inference for Fst, described in Theoretical Population Biology, 63(3): 221-230, 2003 and Molecular Ecology, 13: 969-980, 2004, is available here.
  • MAC5 software for phylogenetic inference, described in McGuire, Denham & Balding (2001a,b) may be downloaded from here.
  • C programs described in Ayres & Balding, Heredity, 1998, and Ayres & Balding, Genetics, 2001, are available here.
  • BATWING software, a development of that described in Wilson & Balding, Genetics,1998, may be downloaded from Ian Wilson's home page.

PROFESSIONAL SOCIETIES:

JOURNALS:

Fellow of the Royal Statistical Society (member of Council 1999-2004).

AE of International Statistical Review (1999 – 2005)

Member of the International Statistical Institute.

AE of Biostatistics (2000 – 2004)

Member of the International Biometric Society (British & Irish President 2006-08)

AE of Human Genomics (2003 – 2006)

Member of the International Genetic Epidemiology Society

AE of Annals of Human Genetics (2004 – )


BACKGROUND:

 

PAST PhDSTUDENTS:

  • Dr David Wright (EPSRC, 1990 – 1995).  Now Senior Statistical Assessor and Scientific Advice Coordinator at Medicines and Healthcare products Regulatory Agency, London
  • Dr Karen Ayres (EPSRC, 1995 – 1998).   Now Lecturer in Applied Statistics, University of Reading
  • Dr Martyn Byng (BBSRC, 1997 – 2001).  Now Senior Technical Consultant at NAG, Oxford.
  • Jacky Civil (BBSRC/GSK, 2000 – 2004; did not submit).  Now with Data Sciences Group at Unilever, Beds UK.
  • Dr Lianne Mayor (MRC, 2002 – 2006).  Now at Deloitte, Reading.
  • Dr Ed Waldron (BBSRC/GSK, 2003-2007).  Now Statistician at Takeda R&D, London.
  • Paul O’Reilly (BBSRC, 2004 2008). Now postdoc in our department.
  • William Astle (MRC, 2004 – 2008). Now postdoc working with me.

CONSULTINGand SHORT COURSES

I offer a statistical consulting service, primarily to lawyers. Most often I advise on statistical aspects of the interpretation of DNA profile evidence, but I have also provided advice on a variety of other statistical aspects of forensic science, epidemiology and other fields.

I teach a one-day short course on "Weight-of-evidence for forensic DNAprofiles", based on my book of the same name, which was last presented at UNSW, Sydney, on July 6 2005.

I also co-teach a two-day short course "Statistical Analysis of Genetic Association Studies", which was last presented at St Mary’s, London, on 28/29 Sept 2006.


PREVIOUS PUBLICATIONS

2006

·        A tutorial on statistical methods for population association studies, Balding DJ, Nature Reviews Genetics 7: 781-791, 2006.  doi:10.1038/nrg1916

·        Clinical factors and ABCB1 polymorphisms in prediction of antiepileptic drug response: a prospective cohort study, Leschziner G, Jorgensen AL, Andrew T, Pirmohamed M, Williamson PR, Marson AG, Coffey AJ, Middleditch C, Rogers J, Bentley DR, Chadwick DW, Balding DJ, Johnson MR, Lancet Neurology 5: 668–76, 2006.

·         Exon sequencing and high resolution haplotype analysis of ABC transporter genes implicated in drug resistance, Leschzinger G, Zabaneh D, Pirmohamed M, Owen A, Rogers J, Coffey AJ, Balding DJ, Bentley DB & Johnson MR, Pharmacogenetics and Genomics, 16: 439-450, 2006.

·         Discrimination of half-siblings when maternal genotypes are known, LR Mayor & Balding DJ, Forensic Science International, 159(2-3): 141-147, 2006. doi:10.1016/j.forsciint.2005.07.007

·         Fine mapping of disease genes via haplotype clustering, ERB Waldron, JC Whittaker & Balding DJ, Genetic Epidemiology, 30: 170-179, 2006 Software

·         Logistic regression protects against population structure in genetic association studies, E Setakis, H Stirnadel & Balding DJ, Genome Research, 16: 290-296, 2006.

·         A family-based association test with a complex phenotype, covariates and interactions: assessing CRP and lupus, Baksh MF, Balding DJ, Vyse TJ & Whittaker JC, Annals of Human Genetics 70: 131-139, 2006.

2003-2005

·         Paternity index calculations when some individuals share commonancestry, KL Ayres & Balding DJ, Forensic Science International, 151(1): 101-103, 2005.

·         The impact of low-cost, genome-wide resequencing on association studies,Balding DJ, editorial for Human Genomics, 2(2): 79-80, 2005.

·         A question of identity, Balding DJ, Significance, 2(1): 20-23, March 2005.

·         Identifying adaptive genetic divergence amongpopulations from genome scans, MA Beaumont & DJB, Molecular Ecology, 13:969-980,2004. Software

·         Clustering of protein domains in the human genome, LRMayor, KP Fleming, A Müller, DJB & MJ Sternberg, Journal of MolecularBiology, 340(5): 991-1004,2004.

·         Little loss of information due to unknown phase forfine-scale LD mapping with SNP genotype data,  AP Morris, JC Whittaker & DJB, American Journal ofHuman Genetics, 74(5):945-953,2004.

·         Multipoint LD mapping narrows location interval andidentifies mutation heterogeneity, A. Morris, J. Whittaker, C-F Xu, LHosking& DJB, Proceedings of theNationalAcademy of Sciences of the USA 100(23):13442–13446,2003.

·         Gametic phase estimation over large genomic regionsusing an adaptive window approach, L. Excoffier, G. Laval & DJB, Human Genomics 1(1): 7-19, 2003.

·         Handbookof Statistical Genetics, 2nd edition, DJB, M.Bishop, & C. Cannings (Eds), Chichester:Wiley, 2003.

·         Inferences from DNA data: population histories,evolutionary processes, and forensic match probabilities, I.J. Wilson, M.E.Weale & DJB, read before the Royal Statistical Society, November 2002, Journalof the Royal Statistical Society A 166(2): 155-187, 2003 (see also discussion pp 188-202).

·         Likelihood-based inference for genetic correlationcoefficients, DJB, Theoretical Population Biology, 63(3): 221-230, 2003.

·         Chromosome-wide distribution of haplotype blocks andthe role of recombination hotspots, M.S. Phillips, R. Lawrence, R.Sachidanandam, A.P. Morris, DJB, + 29 authors + L.R. Cardon, NatureGenetics, 33:382-387,2003.

 

2000-2002

  • Approximate Bayesian computation in population genetics, M.A. Beaumont, W. Zhang, & DJB, Genetics, 162: 2025-2035, 2002.
  • Implications for DNA identification arising from an analysis of Australian forensic databases, K.L. Ayres, J. Chaseling, & DJB, Forensic Science International, 129: 90-98, 2002.
  • Coalescent theory and modeling, DJB, pp 170-175 of Encyclopedia of Evolution, M. Pagel ed, Oxford UP, 2002.
  • Patterns of human diversity, within and among continents, inferred from biallelic DNA polymorphisms, C. Romualdi, DJB, I.S. Nasidze, G. Risch, M. Robichaux, S. Sherry, M. Stoneking, M.A. Batzer & G. Barbujani, Genome Research, 12: 602-612, 2002.
  • Fine scale mapping of disease loci via shattered coalescent modelling of genealogies, A.P. Morris, J.C. Whittaker & DJB, American Journal of Human Genetics, 70: 686-707, 2002.
  • The DNA database search controversy, DJB, Biometrics, 58: 241-244, 2002.
  • MAC5: Bayesian inference of phylogenetic trees from DNA sequences incorporating gaps. G. McGuire, M.C. Denham, & DJB, Bioinformatics , Vol 17, pp 479-480, 2001.
  • Models of evolution for DNA sequences including gaps, G. McGuire, M.C. Denham & DJB, Molecular Biology and Evolution, Vol 18, pp 481-490, 2001.
  • Measuring gametic disequilibrium from multi-locus data, K.L. Ayres & DJB, Genetics, Vol. 157, pp 413-423, 2001. C programs
  • Handbook of Statistical Genetics, DJB, M. Bishop, & C. Cannings (Eds), Chichester: Wiley, 2001.
  • Analysis of infectious disease data from household outbreaks by Markov chain Monte Carlo methods. P.D. O'Neill, DJB, N.G. Becker, M. Eerola, & D. Mollison, Applied Statistics, Vol 49, pp 517-542, 2000.
  • Bayesian fine scale mapping of disease loci using hidden Markov models, A.P. Morris, J.C. Whittaker & DJB, American Journal of Human Genetics, Vol 67, pp 155-169, 2000.
  • Interpreting DNA evidence: can probability theory help? Chap. 3, pp 51-70, of Statistical Science in the Courtroom, J.L. Gastwirth ed, Springer-Verlag, New York, 2000.

 

1997-1999

  • When can a DNA profile be regarded as unique? Science & Justice, Vol. 39, pp 257-260, 1999.
  • Forensic applications of microsatellite markers. pp 198-210 of Microsatellites: evolution and applications, D. Goldstein and C. Schlotterer eds, Oxford UP, 1999.
  • Genealogical inference from microsatellite data . I.J. Wilson and DJB, Genetics, 150, pp 499-510, 1998.
  • Measuring departures from Hardy-Weinberg: a Markov Chain Monte Carlo method for estimating the inbreeding coefficient. K.L. Ayres and DJB, Heredity, 80, pp 769-777, 1998.
  • Inferring coalescence times from DNA sequence data. Simon Tavaré, DJB, R.C. Griffiths and Peter Donnelly, Genetics 145 pp 505-518, 1997.
  • Significant genetic correlations among Caucasians at forensic DNA loci. DJB and R.A. Nichols Heredity 78, pp 583-589, 1997.
  • Errors and misunderstandings in the second NRC report. Jurimetrics, 37, pp 469-476, 1997.
  • The design of pooling experiments for screening a clone map. DJB and D.C. Torney, Fungal Genetics and Biology, 21, pp 302-307, 1997.

 

1994-1996

  • Optimal pooling designs with error detection. DJB & D.C. Torney, J. Comb. Th. A 74, pp 131-140, 1996.
  • Evaluating DNA profile evidence when the suspect is identified through a database search. DJB & P. Donnelly, J. Forensic Sci., 41, pp 603-607, 1996.
  • Population genetics of STR loci in Caucasians. DJB, M. Greenhalgh & R.A. Nichols, Int. J. Legal Med., 108, pp 300-305, 1996.
  • Estimating the age of the common ancestor of men from the ZFY intron. P. Donnelly, S. Tavaré, DJB & R. Griffiths, Science, 272, pp 1357-1359, 1996.
  • A comparative survey of non-adaptive pooling designs, pp 133-154 of "Genetic mapping and DNA sequencing", IMA Volumes in Mathematics and its Applications, T.P. Speed & M.S. Waterman eds, New York: Springer-Verlag, 1996, DJB, W.J. Bruno, E. Knill & D.C. Torney.
  • Inference in forensic identification. DJB & P. Donnelly, J. Roy. Statist. Soc. A 158, 1995, pp 21-53; Discussion paper, read to the Society April 1994.
  • Efficient pooling designs for library screening. W.J. Bruno, E. Knill, D.C. Bruce, DJB, N.A. Doggett, W.W. Sawhill, R.L. Stallings, C.C. Whittaker & D.C. Torney, Genomics 26, pp 21-30, 1995.
  • A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. DJB & R.A. Nichols, Genetica 96, pp 3-12, 1995.   ERRATUM: Table 2 of this article contains an error: in case Mother = AA, Alleged Father = AB, and Child = AB the 2 in the numerator should be deleted, so that the correct LR is 2(F+(1-F)pB)/(1+3F).  This correction is now published in the journal: GENETICA 133(1) May 2008.
  • Estimating products in forensic identification using DNA profiles. DJB, J. Amer. Statist. Assoc. 90, pp 839-844, 1995.
  • Inferring identity from DNA profile evidence. DJB & P. Donnelly, Proc. Natl. Acad. Sci. USA 92, pp 11741-11745, 1995.
  • DNA profile match probability calculation: how to allow for population stratification, relatedness, database selection and single bands. DJB & R.A. Nichols, Forensic Sci. Inter., 64, pp 125-140, 1994.
  • How convincing is DNA evidence?. DJB & P. Donnelly, Nature, 368, pp 285-286, 24 March 1994.
  • Design and analysis of chromosome physical mapping experiments. DJB, Phil. Trans. Roy. Soc. B 244, pp 329-335, 1994.
  • The prosecutor's fallacy and DNA evidence. DJB & P. Donnelly, Crim. Law Rev., October 1994, pp 711-721.

 

1985 - 1993

  • Computations for mapping genomes with clones. C.C. Whittaker, M.O. Mundt, V. Faber, DJB, R.L. Dougherty, R.L. Stallings, S.W. White & D.C. Torney. Int. J. Genome Res., 1, pp 195-226, 1993.
  • Detecting gene conversion: Primate visual pigment genes. DJB & R.A. Nichols, & D. Hunt. Proc. Roy. Soc. B 249, pp 275-280, 1992.
  • Effects of population structure on DNA fingerprint analysis in forensic science. R.A. Nichols & DJB, Heredity, 66 pp 297-302, 1991.
  • Statistical analysis of fingerprint data for ordered clone physical mapping of human chromosomes. DJB & D.C. Torney, Bull. Math. Biol. 53, pp 853-879, 1991.
  • Diffusion-controlled reactions in one dimension: Exact solutions and deterministic approximations. DJB & N.J.B. Green, Phys. Rev. A 40, pp 4585-4592, 1989.
  • Invasion processes and binary annihilation in one dimension. DJB, P. Clifford & N.J.B. Green. Phys. Lett. A 126, pp 481-483, 1988.
  • Diffusion-reaction in one dimension. DJB, J. Appl. Prob. 25, pp 733-743, 1988.
  • Risk factors and heart disease mortality. H. Alexander, DJB, A. Dobson, R. Gibberd, D. Lloyd & S. Leeder, Med. J. Aust. 144, pp 20-22, 1986.
  • A mathematical model of tumour-induced capillary growth. DJB & D.L.S. McElwain. J. Theor. Biol. 114, pp 53-73, 1985.

 
last updated 26 January  2009.