Bill Howard's Y-DNA Papers

The late Bill Howard developed a method for generating phylogenetic trees from Y-DNA STR results. While he was living, he made all of his papers freely available via PDF files accessible via Dropbox links that no longer exist. This web page reproduces Bill's main Dropbox PDF, the one that had links to all the other PDFs, replacing his Dropbox links with links to the PDF files on this website.

My own interest in Bill's work came from my participation in Cliff Johnston's "Johnstons of Annandale" Y-DNA project for a very specific group of us Johnstons. Cliff had found Bill and relied strongly on Bill's phylogenetic trees for interpretation of our Johnston branching.

Bill was a pioneer, breaking new ground and exploring the potential and the limits of his method. The subtitle of his 2014 (he passed away 25 July 2016) "Using Y-DNA Haplotypes to Estimate Their Dates of Origin: Pitfalls and Prospects" paper reflects this well. He was very willing to share his work, and this page allows that sharing to continue by fulling presenting all of the papers he shared with the world

I have replaced Bill's explicit Dropbox links with links you can access by clicking on the title. In cases where Bill did not include a date in the paper's main description, I have included the date, if he gave one in the paper or in the status text. Keep in mind that since Bill has passed away, his offers and contact information are no longer valid. My own inserted additions are in blue.-- Wesley Johnston

Bill Howard Memorials
Roberta Estes' blog page "Lighting Candles - Bill Howard, RIP"
American Astronomical Society Obituary

Download Phylogenetic Trees Python Code and User Manual

Since Bill Howard was very transparent in his papers, in 2018 Wesley Johnston and George El Zakhem in the United States and David Johnston in Australia began developing code to replicate Bill Howard s method. George El Zakhem developed the first implementation in MatLab. David Johnston in Australia then developed a robust Python implementation, which became the core program. George El Zakhem then worked with David Johnston's code to make the first runs on the combined kits of the Eastlick and Lake projects. Wesley Johnston then augmented the code and wrote the user manual. Wesley also developed the RCC1 version of the code, which mutes the impact of multi-step mutations.

The code now available thus represents the culmination of the work of Bill Howard, David Johnston, George El Zakhem and Wesley Johnston.

Index to Bill Howard s Genealogy and Genetic Papers, Videos, and Dated Y-STR Phylogenetic Trees

Use the links below to go to the papers I have been writing and some of the phylogenetic trees I have derived using correlation techniques to analyze the DNA results of pairs of 37 marker Y-Chromosome strings. The status of each paper is given.

PAPERS

  1. The Use of Correlation Techniques for the Analysis of Pairs of Y-STR Haplotypes, Part 1: Rationale, Methodology and Genealogy Time Scale.
    Howard, William E. III, Journal of Genetic Genealogy, 5(2)256-270, 2009

    This published paper introduces the technique and derives a calibrated time scale.

  2. The Use of Correlation Techniques for the Analysis of Pairs of Y-STR Haplotypes, Part 2: Application to Surname and Other Haplotype Clusters.
    Howard, William E. III, Journal of Genetic Genealogy, 5(2)271-288, 2009

    This published paper shows how a correlation analysis using the technique can show the evolution of surnames from subclusters to clusters to interclusters that takes place over several thousand years and how the time scale might serve as a future vehicle to link more closely together the time scales of genealogy, migration patterns, linguistic patterns, geology, archeology, anthropology and paleontology. The derived time scale may be useful to investigate the origin of haplotypes and haplogroups from the most recent ice age back to the date of our origins in Africa.

  3. The Evolution of the Gordon Surname: New Insight From Y-DNA Correlations and Genealogical Pedigrees.
    Gordon, Tei A., and Howard, William E. III, Journal of Genetic Genealogy 7, Fall 2011

    This published paper with Tei Gordon shows detailed relationships between Gordon pedigrees and Y-DNA results of the Gordon surname. It was submitted before Fred Schwab and I (next paper) used the Mathematica application to derive dated phylogenetic trees from the same data. Compare the two papers.

  4. Dating Y-DNA Haplotypes on a Phylogenetic Tree: Tying the Genealogy of Pedigrees and Surname Clusters into Genetic Time Scales
    Howard, William E., III and Schwab, Frederic R., Journal of Genetic Genealogy 7, Fall 2011

    Fred Schwab and I used the Y-DNA Gordon haplotypes (see above paper) in conjunction with the Mathematica application to produce a dated phylogenetic tree, and it is applied to the Gordon surname. Compare the two papers.

  5. A Dated Phylogenetic Tree of M222 SNP Haplotypes: Exploring the DNA of Irish and Scottish Surnames and Possible Ties to Niall and the U N ill Kindred
    Howard, William E. III and McLaughlin, John D., Familia, Ulster Genealogical Review No.27 (2011), pp. 14-50. Ulster Genealogical & Historical Guild, Belfast, Northern Ireland.

    John McLaughlin and I have written a paper that appeared in the journal, Familia, published by the Irish Historical Foundation in its 2011 edition that investigates the M222 SNP. The paper used to be available as a free PDF eBook but the Foundation has apparently discontinued its availability.

  6. Tracing Kee and McKee Genealogy: Using Pedigrees and DNA
    Howard, William E. III, Kee, Robert J., and McFarland, David, Familia, Ulster Genealogical Review No.28 (2012), pp. 99-133. Ulster Genealogical & Historical Guild, Belfast, Northern Ireland.

    Robert Kee, David McFarland and I have written a paper that has been published in the 2012 edition of Familia. Our paper investigates the Kee and McKee families of Co. Tyrone in Northern Ireland. It shows how we pieced together genealogy and family records to form a family tree, showed that it is consistent with the results of Y-DNA tests, and indicates that our three authors share a close common ancestor who probably lived in Scotland about 1670 years ago. The paper contains a pedigree and a dated Y-DNA phylogenetic tree that shows relationships among Kee and McKee testees.

  7. The Time of Origin and the Rate of Formation of Haplogroup I and its Subclades I1 and I2
    Howard, William E. III, submitted to the Journal of Genetic Genealogy 2012

    This paper was submitted to the JoGG in July 2012 but that journal is now defunct. It explores Haplogroup I and its I1 and I2 components and explores the evolution of the components from its progenitor to the present day.

  8. Generating a Dated STR-based Phylogenetic Tree From Y-DNA Haplotypes
    Schwab, Frederic R. and Howard, William E. III, submitted to the Journal of Genetic Genealogy 2012

    Fred Schwab and I present the computer code for generating a dated Y-DNA phylogenetic tree. The code provides surname project administrators and testees with an easily applied tool that produces a comprehensible graphic overview of all the genealogical and genetic relationships and their associated timescales among haplotypes. This paper was submitted to the JoGG in July 2012 but that journal is now defunct. An improved version of the code was produced by Schwab that has more flexibility and a greater span of application.

  9. A Comparative Analysis of the RCC Correlation and FamilyTreeDNA TiPTM Probability Approaches for Estimating the Time to the Most Recent Common Ancestor of a Pair of YDNA Haplotypes
    Howard, William E. III - 22 April 2013, revised 26 January 2014

    This paper compares FTDNA s TiPTM predictions to those of the RCC correlation approach to determining the time to the most recent common ancestor of a pair of Y-DNA haplotypes. It shows that the RCC approach using the sum of the absolute differences of marker values (m) is superior to the use of genetic distance (n) and shows that TiPTM predictions are useful for only about 2/3 of the cases where n = m.

  10. Uniting the Time Scales of Genealogy and Genetics Using Correlation Techniques to Explore Y-DNA
    Howard, William E. III- 25 February 2014

    This paper explores the errors and uncertainties in the RCC correlation approach, shows that the random mutations among the DYS sites produce an observed linear RCC time scale, but that a derived correction may be needed when genetic times are investigated. This paper is still in work and will be revised at this site from time to time.

  11. Using Y-DNA Haplotypes to Estimate Their Dates of Origin - Pitfalls and Prospects -
    Howard, William E. III - 3 August 2014

    Attempts to date Big-Y SNPs and groups of haplotypes need to carefully prepare samples prior to analysis, paying particular attention to: (1) Type A and Type B outliers and RecLOHs, (2) the relationships between genetic distance and predictions using Poisson statistics, (3) the importance of the sequence of junction points on the dated RCC phylogenetic tree, (4) the statistics of departures from an exponential function that expresses the number of junction points as a function of time, and (5) the purity and biases in the sample. The paper suggests a way to determine relative ages of SNPs and subclades and points out areas of exploration that require more research, particularly when dating methods by other analysts are not in agreement. No matter how sophisticated a method looks, it might give wrong results if the approach to dating gives an incorrect emphasis to important features of the sample being dated.

  12. DNA Science: Combining Genetic and Traditional Genealogy
    William E. Howard III and E. J. Hurley - January 2015

    This article provides a basic tutorial on the three main types of DNA testing that are used to aid genealogical inquiries. It is especially tailored to individuals who are curious about their ancestral lines and are considering DNA testing to support their traditional, document-based research. This paper describes the three current major genetic genealogy testing approaches, their differences, and their usefulness in breaking through some of the brick walls met in conventional genealogy.

  13. How to Interpret Entries on an RCC Tree
    Howard, William E. III - 3 April 2015

    This paper addresses questions about how testee results derived from STR haplotype strings presented on the RCC tree should be interpreted and how the evolutionary dates of the lines of descent from a progenitor can be estimated. The paper shows the relationships between Poisson predictions and positions on the tree and shows how rare it is that mutations can change the positions on the tree.

*****************************************************************************

FREQUENTLY ASKED QUESTIONS (FAQs)

Frequently Asked Questions FAQ - Revised 28 January 2013
This site contains a group of Frequently Asked Questions designed to show why the correlation approach should be used in conjunction with the more traditional approaches to Y-DNA analysis. It points out several advantages that the correlation approach has over the traditional approach because of the valuable insight it affords into the evolution of haplotypes. It also addresses points related to my phylogenetic trees.

*************************************************************************************

VIDEOs

  • V1
    The principles of the RCC correlation approach are illustrated in my 15 min. video that shows a surname time slice matrix and a dated Y-DNA tree and ties the evolution of a surname into events in history.
    See: http://www.youtube.com/watch?v=wmylJXlHZZs

  • V2
    Arline and Sidney Sachs produced a 27-minute video in which I trace some of my family roots in an interview with Arline and Chuck Mason.
    See http://tracingroots.nova.org, and search on Howard . This site contains many videos about genealogy and genetics.
    Direct link to the video: https://www.veoh.com/watch/v67017839dH54tGey

  • V3
    After Bill passed away in 2016, CeCe Moore posted the 44:15 video of Bill's 19 Aug 2014 at the i4GG (Institute for Genetic Genealogy) Conference
    Using Correlation Techniques on Y-Chromosome Haplotypes to Date Events - Exploring Different Avenues of Approach -
    See https://www.youtube.com/watch?v=zmuYUIEEp5I and see Bill's PowerPoint slides below.

POWER POINT PRESENTATION:
Using Correlation Techniques on Y-Chromosome Haplotypes to Date Events - Exploring Different Avenues of Approach -
My talk at the International Genetic Genealogy Conference (August 2014). Download this invited talk, attended by about 200 participants.
See the video (V3) of this presentation above.

*************************************************************************************

Y-DNA STR Phylogenetic Trees with Accompanying RCC Time Scale
10 RCC ~ 408 years for 37 markers
Standard Deviation for a Single Pair ~180 years

If you prepare a list of 37, 67 or 111 marker Y-DNA haplotypes that conform to the Template found below, I will try to derive a dated Y-STR phylogenetic tree for you. Please contact me in advance.
[Since Bill has passed away, this offer by him is no longer valid, but the template -- and his generosity and openness to sharing his work -- remain useful.]

TEMPLATE

=========================================================================

Q1a SNPs

  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/104Q1aSNPAverage.pdf - I had not downloaded this file, and the link no longer works. WJ
  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/178Q1aAverageQ1a.pdf - I had not downloaded this file, and the link no longer works. WJ

Hamilton Tree and Group A

  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/149HunterHamiltonA.pdf - I had not downloaded this file, and the link no longer works. WJ
  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/151HunterHamiltonA.pdf - I had not downloaded this file, and the link no longer works. WJ
  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/HamiltonTree.pdf - I had not downloaded this file, and the link no longer works. WJ
  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/MaymoreHamiltons.pdf - I had not downloaded this file, and the link no longer works. WJ

115 Modal Haplotypes

Howard

  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/Howard%20July13.pdf - I had not downloaded this file, and the link no longer works. WJ

McDonald

  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/McDonaldR1a.pdf - I had not downloaded this file, and the link no longer works. WJ
  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/McDonaldR1b.pdf - I had not downloaded this file, and the link no longer works. WJ

McKee

  • https://dl.dropboxusercontent.com/u/59120192/Genealogy/Trees/McKeeNoName120509Avgnewcode.v0.pdf - I had not downloaded this file, and the link no longer works. WJ

Last Update: 29 January 2015 [Date of Bill's last update]

Dr. William E. Howard III
1653 Quail Hollow Court, McLean, VA 22101-3234
wehoward at post.harvard.edu
Skype: wehowardiii


Send E-mail to wwjohnston01@yahoo.com

Last updated 18 Nov 2023: Add definition of RCC