Skip to main content

How To Plan a Successful Datathon

Learn everything you need to know about datathons, from their benefits, how to organize one, and more.
Apr 2021  · 6 min read

What is a datathon? Similar to hackathons, a datathon is an event where participants gather to solve practical problems through the application of data science tools and techniques, by working together in teams to generate insights and potential solutions.

The benefits of a datathon

With the world transforming into a digital one, the idea of a datathon is fast becoming mainstream. The benefits of a datathon, whether organized by a company or by a volunteer group, are extensive.

Encourages teamwork: These events create a special sense of purpose to achieve something greater collectively, drawing out creativity and new perspectives that often emerge when working intensely in a time-bound setting.

Fosters collaboration: In a traditional hierarchical structure, decision-making is based on positional authority. In a horizontal setting, decision-making is based on competence and value contribution, attributes commonly found in high-performing teams. This structure also promotes transparency and candor, requiring team members to exhibit a higher degree of emotional intelligence. More time is devoted to issue identification and less on worrying about being “kept in the loop”. Teams tend to be more autonomous, depending on one another to carry through on commitments. A fluid environment is created where ideas are shared openly and the steps taken to test an idea happen quickly. Some ideas “fail quickly” to produce valuable learnings, while other ideas proceed to the next phase of viability.

Promotes innovation: A traditional corporate culture, particularly that found in the oil and gas sector, is a sensibility that promotes safety and reliability, critical to be successful in this industry. However, such a mindset inadvertently stifles innovation. It is a mindset that fears “what could go wrong” rather than fearing if something good does not occur. In a datathon, the mindset is an entrepreneurial one.

How is a datathon organized?

Firstly, you should make sure these key components are in place—a well-structured dataset that can drive insights on a business problem that needs solving, a positive environment for participants to learn new data skills, a chance for participants to make professional connections, and for everyone to have fun! Secondly, datathons can be run in-person (in adherence to local health authority guidelines during a pandemic, of course) or completely virtual. A participant has the best experience when the event is inclusive and welcoming to both beginners and seasoned data scientists alike. Anchor the event around informative and engaging meetings, which could include a kick-off event to let participants know what to expect, an opening ceremony to start the competition, and a closing ceremony to celebrate its conclusion. Events in between can drive up engagement from a learning and networking perspective, such as through learning workshops and social networking gatherings. Communications throughout the event are kept high through postings on the event’s social media channels, while near-real-time communications is achieved through using IRC-style platforms like Slack, Discord, or Teams.

Sponsorship not only provides much-needed financial support, but it also adds credibility to your event and sponsors can also offer prizes to motivate your participants.

It is human nature to seek out fun activities. Gamification is the application of game playing (e.g. point scoring, competition with others, rules of play) to encourage engagement and motivation. It can also lead to a different, even unexpected, way of solving a business problem. To be inclusive, it should not be the primary focus but is a delightful feature of a datathon.

How long is a datathon?

Datathons can be held over the course of a single day, over multiple days or a weekend, or even span across a month. It’s important to choose the right time period to sustain engagement throughout your event. In-person events are wonderful for building lasting network connections, though virtual events can attract a broader, even global, audience. Longer events also require more effort from the planning team, though the effort to plan a successful single-day datathon is similar to the effort needed to plan a multi-day one.

Want to see a datathon in action?

Come join the GTX 2021 – Geothermal datathon happening between April 20 and June 25, 2021. Hosted in Calgary, Canada, this virtual event will draw people from around the world. Organized by the Society of Petroleum Engineers (Calgary and US Gulf Coast chapters) and Untapped Energy, this datathon is driven by the desire to use real data to solve a real problem, elevate data literacy, and make new professional connections—all while having fun! The problem: aging oil and gas wells represent an option for the transition to a low-carbon energy world and a future for drilling engineers. Geothermal is a suitable approach and will be the focus of this datathon. All participants enjoy:

  • Use of a great energy industry dataset.
  • Access to a series of data science bootcamps and workshops.
  • Access to an incredible library of self-paced online data science courses through DataCamp.
  • Entry into the datathon competition to test and prove data skills.
  • Opportunities to meet and work with fellow data enthusiasts to solve real-world problems.
  • The chance to win a series of great prizes.

More details about this event:

DataCamp – Learning Platform and Learning Competition

All registered participants receive access to the entire DataCamp platform for the duration of the datathon, which includes all of the platform’s 350+ courses, 90 projects, and 10 skill assessments. Additionally, DataCamp is sponsoring a learning competition: the participant who completes the greatest number of courses throughout the datathon will receive a one-year Premium subscription to DataCamp.

I hope this article has helped create excitement about running your first datathon. Please feel free to connect with me on LinkedIn and keep me updated on your datathon. Good luck!


The 23 Top Python Interview Questions & Answers

Essential Python interview questions with examples for job seekers, final-year students, and data professionals.

Abid Ali Awan

22 min

Working with Dates and Times in Python Cheat Sheet

Working with dates and times is essential when manipulating data in Python. Learn the basics of working with datetime data in this cheat sheet.
DataCamp Team's photo

DataCamp Team

Plotly Express Cheat Sheet

Plotly is one of the most widely used data visualization packages in Python. Learn more about it in this cheat sheet.
DataCamp Team's photo

DataCamp Team

0 min

Getting started with Python cheat sheet

Python is the most popular programming language in data science. Use this cheat sheet to jumpstart your Python learning journey.
DataCamp Team's photo

DataCamp Team

8 min

Python pandas tutorial: The ultimate guide for beginners

Are you ready to begin your pandas journey? Here’s a step-by-step guide on how to get started. [Updated November 2022]
Vidhi Chugh's photo

Vidhi Chugh

15 min

Python Iterators and Generators Tutorial

Explore the difference between Python Iterators and Generators and learn which are the best to use in various situations.
Kurtis Pykes 's photo

Kurtis Pykes

10 min

See MoreSee More