Pular para o conteúdo principal

Palestrantes

  • Wouter de Bie Foto de rosto

    Wouter de Bie

    Director of Engineering @ Datadog

Para Empresas

Treinar 2 ou mais pessoas?

Dê acesso à sua equipe à biblioteca completa do DataCamp, com relatórios centralizados, tarefas, projetos e muito mais.
Experimente o DataCamp para EmpresasPara uma solução sob medida , agende uma demonstração.

Impactful Data Engineering—with Datadog's Wouter de Bie

May 2022

Slides

Summary

Data engineering is vital in modern organizations for managing and using large volumes of data. This field has developed quickly, especially with the arrival of the internet and big data technologies. It involves processes like extracting, transforming, and loading data (ETL), which are necessary for different applications such as analytics, recommendations, and reporting. The webinar underlined the value of democratizing data, allowing more users to access and use data for various purposes. Wouter Debye, with his broad experience at companies like Spotify and Datadog, stressed the need for a strong data infrastructure and discussed common difficulties and best practices in hiring and team building within the data engineering domain. The need for agility in technology choices, the importance of self-serve data platforms, and the integration of data management and governance were also main points discussed.

Key Takeaways:

  • Data engineering is necessary for extracting, transforming, and loading data to support various business applications.
  • Democratizing data access within organizations leads to innovation and new use cases.
  • Building a strong data infrastructure is important for scaling data engineering efforts.
  • Hiring should focus on software engineers with an interest for data rather than specific technology experience.
  • Effective data management and governance are vital for maintaining data quality and compliance.

Deep Dives

The Evolution of Data Engineering

Data engineering has developed significantly since the early 2000s, driven by the exponential growth of data due to the internet ...
Ler Mais

and advancements in technology. Initially, data was scattered across various databases, making it hard to combine and analyze. The development of big data technologies like MapReduce by Google revolutionized data engineering by enabling distributed storage and processing. This development allowed companies to centralize data storage and use it for a wide range of applications, from analytics to user behavior tracking. As Wouter Debye noted, the ability to scale data infrastructure has been key in this transformation, allowing companies like Spotify to use data for multiple purposes, including recommendations and artist insights.

Democratizing Data Access

One of the key themes discussed was the democratization of data access within organizations. This concept involves making data easily accessible to various stakeholders, including analysts and data scientists, through self-serve platforms. As Debye emphasized, "Making data discoverable and providing tools for self-service ingestion and transformation can unlock significant value." By enabling wider access to data, companies can encourage innovation and allow different departments to use data for unique insights and applications. This approach also reduces bottlenecks in data processing and empowers non-technical users to engage with data more effectively.

Building Strong Data Infrastructure

A strong data infrastructure is the backbone of effective data engineering. It involves creating systems that support the easy extraction, transformation, and loading of data across various applications. Wouter Debye shared insights from his experience at Datadog, underlining the importance of having a centralized data platform that can handle different use cases. This infrastructure must be scalable and flexible to accommodate the growing demands for data across the organization. By investing in proper infrastructure, companies can avoid reinventing the wheel for each new use case and allow data engineers to focus on delivering value through data-driven solutions.

Effective Hiring and Team Building

Hiring in data engineering requires a focus on identifying versatile software engineers who have a passion for working with data. Debye advised against focusing on specific technologies, as the field of data engineering is constantly evolving. Instead, hiring should prioritize candidates who are adaptable and can quickly learn new tools and frameworks. Building teams with a mix of technical and soft skills is also important, as data engineers often need to collaborate with various departments to understand and meet their data needs. As Debye mentioned, "Being able to translate technical requirements into business requirements and vice versa is a good skill to have."


Relacionado

white paper

Insights from Data Leaders

Distilled insights on data transformation from data science thought leaders

webinar

Storytelling for more impactful data science

Storytelling enables data teams to formulate impactful aspects of their work.

webinar

Driving Impact with Data Storytelling

Eight best practices you can adopt right now to become a better data storyteller

webinar

Data Storytelling: The Secret to Delivering Business Impact

Data Storytelling: The Secret to Delivering Business Impact with Analytics

webinar

Data Skills to Future-Proof Your Organization

Discover how to develop data skills at scale across your organization.

webinar

Democratizing Data Science at Your Company

Data science isn't just for data scientists. It's for everyone at your company.