About This Project

Introduction

WhereAreTheHikers.com, launched in 2018, hopes to answer two questions that frequently appear in Appalachian Trail-related discussions:

  1. Where is the bubble right now?
  2. When will the bubble get to {{insert shelter/town/landmark}}?

Those intimately familiar with the AT have come to learn its usage trends by observation and experience, but hopefully this site’s visualizations will serve as a confirmation of this "folk knowledge".


How It Works

The visualizations on this site are generated from the locations and dates contained in the Trail Journals entries of Appalachian Trail hikers from 2001-2016. The data is not limited to thru-hikers only; it visualizes all warm bodies on the trail for each day of the year, whether they're thru, section, or day hikers. The final data set includes data from:

  • 6,000 hikers
  • 234,000 individual entries
  • 910 locations
    • The top three locations visited were:
      • Damascus, VA (4,508 entries)
      • Hot Springs, NC (3,346 entries)
      • Springer Mountain (2,604 entries)
  • 365 days of the year
    • April 15 had the most entries (1,611)
    • December 24-26 had the fewest (9 each)
AT Timelapse

More Details for Data Buffs

I ran a script on all Appalachian Trail hiker journals from 2001-2016 to gather each entry's starting location and journal entry date ("Start Location" was used rather than "Destination" since there was no guarantee that the hiker would actually reach their desired destination that day).

There were 325,000 entries in total, with 234,000 having usable data in the Start Location field. The rest either were either blank, contained non-AT locations (such as "home"), vague locations (such as "stealth camp" or "in the woods"), or locations that were otherwise unidentifiable.

All 234,000 entries were manually examined and standardized to one of 910 known locations along the trail. While some automation was used to identify large batches of locations, manual examination was necessary to identify entries with misspellings, alternate spellings, or outdated names.

Also, as thru-hikers will doubtless know, multiple locations along the trail share the same name (e.g., there are two shelters named Cove Mountain, two towns named Waynesboro, and five(!) Deep Gaps). Differentiating these was done by using the "Destination" field to determine which geographic area the hiker was in at the time.

The data is stored in a MySQL database, and the visualizatons are generated using the Chart.js library and Vue.js framework. Below is an example of how the data is structured in the database. "Weight" represents the number of hikers found a set of coordinates on that particular day of the year.


About Me

WhereAreTheHikers.com was created by Rachel "Raincrow" Maderik, a part-time section hiker and full-time systems librarian and developer who lives in the Blue Ridge Mountains of Virginia.

Obligatory McAfee Knob photo