Skip to main content
HomePodcastsData Literacy

Driving Data Democratization with Lilac Schoenbeck, Vice President of Strategic Initiatives at Rocket Software

Richie and Lilac explore data democratization, common data problems that data democratization can solve, confidence with data, good data culture, processes to encourage good data usage and much more.
Nov 2023

Photo of Lilac Schoenbeck
Guest
Lilac Schoenbeck

Lilac Schoenbeck is the Vice President of Strategic Initiatives at Rocket Software. Lilac has two decades of experience in enterprise software, data center technology and cloud, with wider experience in product marketing, pricing and packaging, corporate strategy, M&A integrations and product management. Lilac is passionate about delivering exceptional technology to IT teams that helps them drive value for their businesses.


Photo of Richie Cotton
Host
Richie Cotton

Richie helps individuals and organizations get better at using data and AI. He's been a data scientist since before it was called data science, and has written two books and created many DataCamp courses on the subject. He is a host of the DataFramed podcast, and runs DataCamp's webinar program.

Key Quotes

The companies that I've seen be successful with this are not actually trying to overhaul the world on day one. Because it's a fruitless effort. New systems are gonna come in every three months and six months and we'll be chasing our tails the whole time. Like let's build a practice of incorporating data into our view of the world and have that continue to expand incrementally. And if you do it that way, you also get the buy-in of the business, the people sort of clamoring for more. And when people clamor for more, you typically get the budget, you typically get the focus, right? And that creating that demand within your organization is gonna help move the ball forward.

When thinking about improving your organization's data usage, start small and try to set aside the fear. Create little bits of willing, engaged pockets of applications and people. Start with where the enthusiasm is and where the demand is, and incrementally grow from there. That's the best approach because if you get some early wins, if you get some demand, if you get some enthusiasm, from there that can really motivate your organization.

Key Takeaways

1

Democratize access to data: Improve access to data by providing the right tools, such as a data catalog and glossary, to ensure a shared understanding of truth within the organization.

2

Foster collaboration: Encourage collaboration between data scientists and business teams to support data queries and establish a shared understanding of data across the organization.

3

Create a data topology map: Build a comprehensive view of how data flows through the organization to identify dependencies and make informed decisions about data management and system changes.

Links From The Show

Transcript

Richie Cotton: Welcome to DataFramed. This is Richie. Having all your colleagues able to use data effectively is a goal for, well, frankly, every organization. But knowing that you need to do that is only one step of many, and it leads to a lot of tricky questions like... Who needs what data skills? And how should the data team interact with the rest of the organization?

And what data do we actually have anyway? So to do data democratization right, there are many things you need to think about, from high level strategy, Down to process changes and personnel interactions. To help you out today, I'm talking to Lilac Schoenbeck, the vice president of strategic initiatives at Rocket Software.

Lilac's gone from being a developer to managing business development teams and marketing teams and product teams. So she's got an incredibly rich view of many aspects of business. At Rocket, she's been heavily involved in their data intelligence platform that's designed to help bring data to the corporate masses.

I'm fascinated to hear her perspectives on how to improve data across whole organizations.

Hi Lilac, thank you for joining me on the show.

Lilac Schoenbeck: Hi, Richie. Nice to see you.

Richie Cotton: think to begin with, let's, make sure that everyone understands what we're talking about. So what does data democratization mean to you?

Lilac Schoenbeck: Data democratization is an interesting word. I love that it feels like it's bringing freedom to the... See more

data within an organization. I, I think the biggest piece for me is that it means that what we're providing is broader access to data within a company to a number of people who could use it to make different kinds of decisions.

And I think all of us have been inside organizations where decisions are on the basis of a lot of things, sometimes just good hunches, sometimes an understanding or a perceived understanding of truth, having the ability to access the company's data will enable far better decision making, and I think democratizing it means that it's not just the senior executives who are privy to the data.

data, but actually anybody within the organization that should have access to it can make a driven set of decisions.

Richie Cotton: Excellent. So it really is about bringing things to the whole company. and so I think in some organizations, there is this perception that data should be something that the data team does, and it's not necessarily a concern of anyone else. So what do you think the advantages are of having this widespread access and knowledge about data?

Lilac Schoenbeck: You know, It's an interesting question. I think, the reason people say that is that data feels trickier, complicated, and we all took stats class in college, and we remember that three standard deviations is something that we actually couldn't define at a cocktail party if we tried. then there's this concern, right, that actually data is often misunderstood or misread.

So when I say a certain field implies, I don't know, total revenue or customer account, you'll tell me, well, actually, it's customer account on alternate Thursdays, and you didn't know the asterisk associated with that. So maybe you'd misinterpret it. So there's this sort of persistent belief that it requires a specialized set of knowledge.

if not skills, in order to understand the data. I think the key is actually transparency there, right? The key is to build a framework for the data that allows let's call it lay people, normal humans to query that data and get a sense for the answers. I think the second piece, though, that, also harkens back to, I think, perhaps all of the trauma we had from college physics, number of significant digits is actually a really reasonable concept in this construct, right?

There's a difference between saying precisely, you know, 47. 35 percent of our customers live in Idaho, Versus saying, you know, the majority of our customers are in the Northwest, and I think that The notion of democratizing data and allowing people who aren't the data scientists to have access to broader swaths of data also comes with a certain understanding or access to the education of how to draw conclusions from information and what a degree of significance would be in the variability they're seeing in data, So there's an education process there but I think it's one that's congruent with. the way that people are approaching business more broadly and less hunches, less feelings, more, what, pieces of data can we bring to bear to validate our hypotheses? And at the highest level those broad trend lines and basic understandings being distributed across an organization leads to better decisioning.

Richie Cotton: Okay. So you're not just, writing some extensive thesis every time you make a decision with data. It's just about like, get some sensible numbers and use them to

Lilac Schoenbeck: right. That's right. I mean, most companies will tell you most enterprise software companies, and this is is sort of the domain I've been in for 25 years, will tell you most of our customers are in financial services, health care or government. We will say that every time it says that on every single prospectus for every software company ever.

How many of us actually know when we say most? Do we mean 50%? Do we mean 95%? And it really depends on the company. But then the downstream decision you're going to make, for example, to advertise in something as simple, I'm advertising in the Wall Street Journal, or am I going to advertise in grocery stores today, right?

So the distinctions that actually, a vague sense of that number will change your answer. And it doesn't need precision, and it doesn't need, lot of deep data understanding. It just needs a high level understanding of what's vaguely going on.

Richie Cotton: Yeah, that does seem pretty sensible, like have a vague idea of what's going on in your company and that you can use data for that. Excellent. I'd like to hear about some success stories. So do you have any examples of companies that have gone all in on data democratization and they've seen some sort of benefit?

Lilac Schoenbeck: Yeah, and I think, I think all it is interesting because I feel like it's very much an evolution. But some organizations that we've worked with, large financial services firms, shockingly enough have gone really far in the idea of providing basically a data catalog, a data library, and, also in addition, a set of very talented and collaborative data scientists on their teams that support the business in their data queries.

And I think the, benefits of it are that they are able to anchor the whole organization on a shared understanding of truth. That's the first big step, because what often happens, and I think in any organization any of us have been in, is that one department says something is true, and another department says another thing is true, and you could actually waste the entire meeting debating which one is true which isn't a good use of anybody's time and often, by the way, stems from a misunderstanding of definitions, or a misalignment of what XYZ might mean, or is it revenue in this period, or revenue in the last quarter, revenue in the last 12 months, or revenue in the last fiscal year, right?

These distinctions actually are important. would make different versions of the truth. And so what these organizations that have been successful have done has, have been first and foremost to create a truth of the data that can be agreed upon. And the benefit of that is that actually what we want from our senior leaders, from the people making decisions across the organization is not to debate truth.

But to leverage truth in order to make other decisions. And that's the level that they have made very successful. They can decide, am I going to switch a system, or change an approach to a database, or do these other things, because I understand the impact of that across my entire data fabric and footprint.

That's a data driven decision. I don't need to sit there and debate whether or not the hip bone is connected to the leg bone, or whether I've misunderstood the whole skeleton. It does seem that, the sort of inter team or inter departmental conflicts about the different perceptions of what the company's doing or how the world works, they just seem to happen far too frequently. I do like that idea that data is this sort of mediating force, where you...

Richie Cotton: It's going to help different companies agree and not have stupid arguments.

Lilac Schoenbeck: I mean, let's not pretend we're going to eliminate them completely. That feels really

Richie Cotton: Well, yeah,

Lilac Schoenbeck: to our humanity.

Richie Cotton: fighting with other teams is often fun anyway, so yeah. Maybe it needs to continue a little bit. alright, can you tell me about what some of the most common, data problems are that you've seen businesses face? And then having the idea of data democratization, is going to help this.

Lilac Schoenbeck: Yeah, I think one of the biggest things, and this is funny because I'm, I'm thinking back to I grew up in the Pacific Northwest and there was a, there's obviously a large airplane manufacturer out there. And I just remember as a young computer science student, I couldn't have fathomed when they came and spoke or I don't remember the context and said how many applications they had running.

And the number was something like 10, 000 applications running within their walls. Now, I haven't, that's. 25 year old data, so I'm going to say I have no idea if that's grown or shrunk. But it really impressed upon me the number of different systems within a large organization. And each one of those systems is creating and generating data.

Back in the day, we printed it out on those like really loud printers that went, you know, with a little piece of paper on the edges that we tore off and used to make small figure puppets. But the situation now is that we still have largely, in a large bank, in a large multinational corporation, a thousand or more applications running often in the multi thousands, each one of them creating their own data, and each one of them with a set of dependencies that normal human brains cannot possibly fathom, right?

It's just too complex a system. We have built Something of a house of cards within these organizations. And then the challenge is, is that we also want to continue to evolve these applications, modernize these applications. What we hear is, I'd like to move some percentage of my footprint to a SaaS based alternative and not an on premise homegrown thing, right?

That's a daily conversation for CIOs around the world. How do you even know what the impact would be of removing that one piece from the system? And that impact actually could be cataclysmic, Or what you could decide is I'm going to understand the top three impacts and let the next hundred just come as they may.

But the Jenga tower that we've built in this situation is actually quite terrifying. And so one key piece that organizations benefit from is an actual topology of how data and data dependency works within all of those organizations so that you can make an intelligent decision about the order even in which you make changes, And so that's a piece that we, that we found is really, really productive for an organization is to just have a map of you. How does the data flow through my organization? And we all know that your name is spelled right in one of the systems and wrong in the other, that your address got updated in one place and not the other.

And if you can extrapolate that to all the internal pieces of data that an organization might have, truly, it's a stew, it's a mess. And that's not a bad thing. I don't actually believe that this is an attic that needs to be tidied. I believe that this is, the chaos is the state of the system.

But having the tools to be able to navigate that chaos, to understand the impact analysis of making changes, to understand what has GDPR type situation or event, that's invaluable to a company. And it actually frees you up to make different kinds of business decisions because there isn't that latent and wholly legitimate terror that something is going to go horribly wrong,

Richie Cotton: It does sound like understanding what data you have and who's using it and how it all connects together seems like a really good sort of starting point for any kind of program. And yeah, you think that that ought to be the case everywhere, but now I'm thinking about it. I'm sure it's probably the case.

Absolutely. No way.

Lilac Schoenbeck: Very rarely, and it's further complicated by the fact that we have systems in, again, in larger enterprises, large organizations that have been around sometimes for a hundred years. some of the systems are. antiquated, some of the systems are barely running, some of them are robust and lively mainframes that are still very much running the transactions of the world.

Some of them are distributed systems on premise, some are SAS systems in the cloud, some are distributed systems in the cloud, some are homegrown, some are commercial, some have been bought, some have been acquired. There's actually no reason to expect an organization to have a view of this. That's not a rational expectation for a company that's more than 10 years old, honestly.

And I actually, one of my major rants that I like to pull out on podcasts is I think it is not productive to shame our IT team for some of the emergent properties of the I. T. estate and footprint. I think it's not fair. It's not productive. They have been working for decades to try to build something that supports their business and evolves and changes at a pace that almost no other part of the business evolves.

So I think accepting What we have and working with it, but then providing the tools and using the greatest and latest technology in order to mitigate some of those emerging properties is the right approach.

Richie Cotton: Excellent. Yeah. I'm sure the IT people listening to this will be like, Oh yeah, it's good. We shouldn't get the blame for this. and no, cause it is very easy to just be like, okay, well, yeah. The software is not working as it should do. I'm going to blame the IT department.

Lilac Schoenbeck: They should have had it tidied. Really? Really? I don't think so. No, no.

Richie Cotton: alright, suppose, management decides this is a good idea and your CEO makes this big pronouncement about, okay, we're going to become a data driven company. where do you start? Like, what's step one?

Lilac Schoenbeck: I think the first, there's a few pieces that are really helpful right up front, right? I think that map or view of the entire organization's data and entire is an incremental thing, right? Maybe we start initially with the databases and some critical systems and we slowly, slowly add additional visibility.

I think the piece to think about is where the critical. flows of data, the critical transaction points. And so that often requires an understanding of some of those legacy systems, right? The mainframe systems of the other ones that are really at the core of a company, but also extending all the way out to the edge.

Because what we'll also hear is that data flows into your AWS snowflake implementation, right? And so what you want is to be able to broadly span the organization. The second piece harkens back to what we started with earlier, right? The notion of a glossary or a shared understanding of what. Oh, word means um, is actually critical, And I think finally I've never met data people who actually really want to be the possessive insular. Quarters of the data. Most of the time they are delighted to tell you all about it and it's explained to you the nuances of it and actually educate the rest of the organization. They, they typically love this, right?

You don't enter the data field without really loving to talk about data. So I think any such program really should have an evangelical or educational component with from the data team to the rest of the organization in order again overcome that need for precision in places where it might not be necessary, right?

Or some of the fear that perhaps might be pervasive in an organization amongst people who don't usually have these tools, So how do we help people get over the hump and understand what they're looking at and in doing so create some basic dashboards maybe that they can Thanks, everyone. that they feel are approachable, and then, the mechanisms for follow up analysis, if that's what's called for.

Richie Cotton: Okay, I think it's generally true that data people do actually quite like talking about data, for sure. Certainly I like talking about data. That's why I'm here. alright, Once you've, got going on these first steps, what are the sort of the goals you need to set in terms of improving data capabilities?

Lilac Schoenbeck: Yeah, I think that's really an interesting question. I think part of it is cultural, Part of what you're looking for is is the motivation to ask the question, well, on what are we basing this hypothesis or how do we test this hypothesis? How do we test this assumption? Is there data that we presently have in the organization that would help validate which course to take out of these various courses?

And that's a, that is essentially a cultural thing, That's thing that is affected by the leadership as they choose to pivot the way in which decisioning is made. And then I think some of the other pieces, again, there's access, right? If you look 10 years ago, the people who had access to the data dashboards, if they even existed, the Tableaus or whatever of the world, were a limited, limited group.

I think we have to come to a place where we recognize that at least the data that we feel comfortable broadly sharing in the organization is actually shared across the organization. So we need to give people that visibility partly because as you start playing with the tool, you become more facile with it, and you sort of begin to take it on a little bit more as something that's part of your life.

we did this a lot This is a strange analog, but I think it actually applies. When we, in the long ago times, when we rolled out like a Slack or a Teams or an Instant Messenger within a company. Now, I don't know about you, but I've been using some version of Instant Messenger since AIM back in the day, when it was beeping across my screen.

But in corporate America, was not using a ton of Instant Messenger a decade ago. And as we rolled it out, I remember that it was not pervasively rolled out. There was often like this sort of have and have nots of Instant Messenger. And then eventually, like we got to a place where... When something is rolled out, everybody understands the rules of the game and understands how to behave properly in this context and what it means to send a message to the entire channel.

Bob, so maybe stop. And, like, the same thing has to roll out with data, I think you get, give a bunch of more people access and then slowly ratchet up the literacy that is happening within the organization and so that the rules of the game and the rules of engagement become more democratized.

Richie Cotton: I really do like your point about, people just messaging the entire channel on whatever messaging program. Yeah, that's terrible. It's like, the same people are probably still doing like reply all and send to the whole company on email. Yeah. So, yeah. I think, it's also appropriate in the, in the data space, right, because you need to learn to talk about data and that just seems like a, an important part of data democratization. do you have any advice on this? Like, what, what's a good way to communicate around data with your colleagues?

Lilac Schoenbeck: I think that's really, it's interesting because I think, my feeling is that a lot of people don't have the latent training in this, or that there's, there's huge swaths of an organization that might even say that they didn't have a ton of math education, or, like, I didn't take statistics, I was more of a creative type, like, people will say things like that, which I actually think is I would encourage people not to take that approach, right?

Like, it, it's unnecessarily creating a barrier in your mind to something that doesn't have to be unapproachable, right? Most of us are quite capable of going to the store and determining which of the, which is the cheaper item to buy. This is not any more complicated than that, right? It doesn't have to be.

So I think the first thing is to use language that doesn't, Elevate the fear levels across an organization around what this might mean. And then I actually think that your data team is probably your best ambassador for, for education and wanting to elevate the nature of the discourse across a company.

And so to the extent that they're interested in becoming sort of data business partners, maybe in the model like an, you often get an HR business partner or you get a finance business partner. A data business partner is an interesting idea of somebody that might be able to support a group.

And, and provide that sort of. low grade understanding and education as people get their feet wet and move into it. I would be remiss, not to note that one of the big sensitivities around all of this is the fact that not all data actually should run free, Contrary to 1999 Google's manifesto, All data does not actually want to be free. Some data has to be very much bound by compliance requirements and by corporate. security requirements, and, and, and. You don't take an organization of 50, 000 employees and tell them, live and be well. Here's all your data, go crazy, right? Because there's a number of reasons why that would be a bad idea.

And so I think all of this sort of requires us to build the kind of boundaries on the data that keep the organization safe and the individuals safe. and I think that that's a piece that the degree to which that can be transparent, clear, understandable, so that people don't feel like there's danger associated with playing with this stuff.

and so that the organization doesn't incur any risk, right? Because all you need is a, is a compliance violation in a, in a system like this. so having the, the right kind of boundaries in your system that provides the right kind of role based access is absolutely paramount.

Richie Cotton: Okay, yeah, so you probably don't want the HR data being available to absolutely everyone. I, I can see

Lilac Schoenbeck: not.

Richie Cotton: okay, so, if you're setting up a data democratization program, then who needs to be involved in this? are there any particular teams or roles that need to take part?

Lilac Schoenbeck: Yeah, I think, obviously, like a chief data officer or CIO are the obvious, like, leaders of this type of action. But when I think through who else has a role in this, it becomes some of everybody, right? The financial data is often led by the FP& A team. They're the people that are the stewards of that information.

On the other hand, if you thought about pipeline and sales data, right, that's sales ops and marketing that want access to that. What we don't see that often How do I put this? I think every person has a hand in a subset of the organization's data, and every leader probably has. Has some thoughts.

You could even talk about the data associated with the the, the actual work of a business unit, the investments of a, financial services company, or the quality of a piece of software, right? All of that usually feeds into the data footprint of a company. I think when we really think about it, that there's huge value.

in those different organizations understanding each other. And so first we get to the point that everybody is anting up their publicly consumable views of the world, And then the second step, I think, is to engage with the data of your friends and colleagues, right? Because that's actually going to give you a better holistic understanding of the business.

And that's often where we get things. wrong or where they stagnate, right, is that it's easy for finance to focus on finance data and marketing to focus on marketing data. But if we take a big step back, we realize that them coming, those pieces coming together actually is the strategy of the business.

Richie Cotton: That seems interesting because you're always going to have an adjacent team, like maybe the team that you work with a little bit and understanding their data as well as your own, at least casually does seem like, a good way to sort of gel. And I guess, again, it's coming back to not having fights with your colleagues as often.

Lilac Schoenbeck: Yeah, and empathy for what they're dealing with or their challenges, right? Because it's so easy. One of the places where we make a lot of not data driven assumptions are around what the other team could or should do, And so if you look at their, well, they should be, and you're like, well, okay, but they're starting.

at a much different position than perhaps you're giving them credit for, right? And so having an access to the baseline and the data that they're, that they're working with and, it provides a better basis for negotiation and understanding.

Richie Cotton: so I'd like to talk a little bit about tooling. and so improving access to data, part of the story is going to be about having the right tools for this. can you talk about like, what are the sort of, the components of a data stack that are going to assist with democratizing access to data?

Lilac Schoenbeck: Yeah, I think that's, that's an interesting question. So I think there's elements. So we talked a little bit about the element of lineage and understanding the lineage between all the systems and how data flows. We also talked a little bit about a data catalog. And a glossary, which sort of work hand in hand to make sure that we know what we've got in there and which are the pieces that matter and which are the ones that are right.

There's a data quality component to that, where the backend system of the bank that keeps track of your account is probably the better source for your address. than whatever you entered into the web form when you were sitting at a bar trying to get a coupon, and so these, distinctions, right, and being able to guide people as they move through the data.

There's obviously the front end interface to the whole thing with a BI tool, whichever one is your preference. There are so many these days on the market. I think all of these things are critical components. and differingly critical at different points in your evolution. The thing that I'm not saying and intentionally not saying is that I don't think you have to wait until some sort of idyllic platonic world where you have got all of your data in one streamlined, beautiful data lake and everything is tidy and then we can start doing this.

Right? I think that's perhaps just a fruitless approach, right? And so while I appreciate the And, and I've seen tremendous results from people working with data lakes in order to get just a lot of flexibility in their data environment. the notion that that has to be done and dusted as step one fully before we move on to step two, I think is a, is a mistake, I think the best way is to actually accept the, the systems that you have. Accept who you are, IT organization, and work with that and find the connectors, find the scanners, find the pieces, find the even like the change data capture tools and the other pieces that help you bring pieces together, incrementally over time, the companies that I've seen be successful with this are not actually trying to overhaul the world on day one, because it's a fruitless effort, new systems are going to come in every three months and six months.

And we'll be chasing our tails the whole time. Like, Let's build a practice of incorporating data into our view of the world and have that continue to expand incrementally. And if you do it that way, you also get the buy in of the business, the people sort of clamoring for more. And when people clamor for more, you typically get the budget and typically get the focus, right?

And that, creating that demand within your organization is going to help move the ball forward.

Richie Cotton: That's interesting. And, the idea that you need to have some sort of little success story in order to drive, excitement and, make more things happen, that seems really important. do you have any ideas for what can give you a quick win here? Can you make that a bit more concrete?

Lilac Schoenbeck: I think I probably have a a non technical answer for that. I'm going to give you a non technical answer for that. I think a quick win is actually on the cultural side. I think a quick win is for senior leaders within the organization to commit to leveraging data more heavily in their communications, And when we see, and I've seen this repeatedly in organizations, when we see data being presented in an all hands meeting or a town hall meeting, it is universally adored. And when we can take the time to explain that so that every individual in the organization knows what Revenue means or bookings means or right, all of these things that might seem commonplace to those of us who deal in these pieces of data, but maybe a little more esoteric to those of us who don't, it is universally loved and the degree to which we can apply it.

increase the clip of that, the types of data, the regularity of that, that's actually going to change the entire culture of the organization so that people want more. One of the things we always talk about, right, with data is that the minute you're presented with a bunch of it, you have five follow up questions, That's actually what's going to propel you forward, is those five follow up questions. When people say, well, that's interesting, but how does it vary by region? Now you've got them. That's what we need to catalyze. and I think it is actually incumbent on the leadership of an organization to make that happen.

Richie Cotton: I do like the idea that, it's question driven development and it's like, well, these are the things that your actual end users are asking about the data and that's going to give you some, inspiration in terms of like what you need to do next in terms of your data, processes and stack.

Excellent. so I'd like to talk a little bit about data silo because it seems like this is like maybe one of the biggest, almost common problems that organizations face with respect to data. how do you, overcome this, idea that, some people in the organization have access to some data sets, but not everyone does and things aren't joined together.

Lilac Schoenbeck: Yeah, I think that's interesting because sometimes, as we said, it's necessary, right? Your HR data is not going to be democratized outside of HR anytime soon. And that is good and right. And so there's some, I think the word silo has a slightly negative connotation because it does feel a little obstructionist, right?

But let's admit that there are places where the silo is actually good and right. Um, There's other places where the word silo sort of is meant to... Perhaps imply a little bit of protectionism or hoarding or gatekeeping by the people who have that information. The thing about it is, think long term that approach does not win within an organization, Because then what you're stuck with is a situation where people either have to come to you every single time. That becomes tedious, and then your data becomes less relevant, right? If you were a gatekeeper of that, then eventually people are going to say, I'm not going to work with Janice. Like, it's not worth it.

I'm going to find a way to get my answer by going around that. And in doing so, you actually starve yourself out. Right? You starve yourself out of the conversation. That's not a productive stance. On the other hand, if you do engage and you do break down that silo and give willingly of your data, then people start saying, well, Janice has all the answers.

Why don't you just ask Janice? She's got all this, and What I have found, again, is I think it's an old school philosophy to build a silo, an unnecessary silo, it's a, protect my job, old school philosophy. And increasingly, right, we're in a work, in work environments where people move around regularly, like, I just think the culture of the whole thing is far, far less.

I will build my dominion and no one will enter my castle walls. That's not how we operate as humans and companies anymore because your dominion is probably fickle anyway, but it'll come, it'll go. And so I think what we're seeing, is a generational shift of more collaboration in part because in that collaboration, you actually gain power, right?

And if that's ultimately where you're going with your style, right? That's going to be helpful. And so the people that are most collaborative on the data front, right? are the ones that are turned to most, that are valued most, that are part of conversations that they wouldn't otherwise perhaps be part of.

That feels like a reinforcing cycle.

Richie Cotton: That's really interesting. I haven't really heard the idea that collaboration is a good career move in some sense, with respect to data. but actually, yeah, if you're talking about data to other people, you're going to do some networking, get involved in those great conversations and it probably is a good career move.

Excellent. all right. I'd like to talk a bit about skills. and you mentioned that maybe one of the things, you can do to, in terms of tooling to improve the data situation, you get your new, business intelligence tool, but then often, if. There's no point in just buying this business intelligence tool unless people know how to use it.

And often people don't feel confident, working with data or creating reports or whatever. so what can you do to make sure that these new tools are adopted and they're being used effectively?

Lilac Schoenbeck: Yeah, that's a good question, right? And I think always seeding the initial dashboards, always creating a set of a view of the world, That people don't have to go in and create for themselves from scratch. There's a the sort of analogy of like when you buy the Lego set, right? It comes with the instructions to build the Death Star or whatever.

Now, any Go to Beadaholique. Lego Master, if you've seen them on TV, right, can build anything. They can build the Earth, they can build Saturn, they can build whatever, right? Me, I'd be lucky to build a Death Star, and I'm probably just going to build a small hut, right? That's probably as far as I'm going to get.

But having the instructions and having the kit makes it much, much more approachable, right? As you're with this bag of tiny bricks. And that's very much the way that it works with data, right? So what we're seeing is that organizations put up a set of very purpose built dashboards, often for their...

constituent customer base within the company. And then, as I said, like the questions spawn the greater granularity, like, and sometimes that's visualization. I prefer to see this as a as a stacked bar chart, and then there's sometimes there's follow up questions. Can we cut the data by, I don't know, region or by or by industry or whatever.

It's those follow up questions and it's that challenge that is actually what gets you to engage. But you do have to start with something, a big box of weird shaped bricks. Often people can't actually imagine what they could build with that.

Richie Cotton: yeah, it's amazing how like the small children get the idea of Lego. They're both thinking from their imagination. Grown ups are, are a bit less good at that. Uh, and so related to the imagination problem, there's also like a more general confidence problem. And Do you have any tips to make data seem less scary and a bit more approachable?

Lilac Schoenbeck: Everybody puts them in pie charts, and I have seen so many completely borked up pie charts in my life, I can't even talk about it. I don't believe the pie chart is more approachable than the bar graph. I'm just going to start with that. I think there's an element Again, I think the average data scientist is a delightful human that actually doesn't want to make this impenetrable, right?

They don't actually want to talk in terms of statistical language. They actually want to talk about their findings and what they've learned. So firstly, I would just say that I don't, I believe that this is sort of some latent STEM PTSD that we have in the culture that actually has nothing to do with the individuals in the field today.

The other piece that I would just say is less is more, particularly as we're starting, I also, I have, I have data scientists I know and love. And my God, would they love to give me a spreadsheet that barely loads on my laptop? And that is lovely. But the truth is, is that I consume, as not a data scientist, right?

consuming something that is a little bit more streamlined and a little bit more focused is easier for me to sort of read what's going on. I think that's true. for anybody entering this domain, right? And so I think from the perspective of the data team, how do I present a limited set of actionable pieces of data and then let people dig deeper as they want to?

The inclination is to be like, look at all this great stuff I found. And I, I get that, right? But as people are stepping into something, how do we make it? not feel overwhelming in its sheer mass as well.

Richie Cotton: Yeah, there's definitely something to be said for appendices and burying the detail a bit later on in an optional manner. okay. So are there any, Data skills that think ought to be universal. Like what does everyone need to know about data?

Lilac Schoenbeck: Oh, gosh. I think there's basic math skills that we wish were a little bit more universal, right? Because there's, there's so many times when you're thinking to yourself that's not how you divide fractions, right? and, and it triggers those of us who, who, who do have that background. I think that the primary piece is just a little bit around language, what does it mean? To be a percent, what does it mean to understand the denominator? feel like I say that like a hundred times a month. Like, what's the denominator on that number? Because if I don't understand the words next to the number, I don't actually understand the number, and that happens over and over again, where somebody will say it's like 83 percent of what? And, and so I think being, having the tools actually to ask the word questions around the numbers is probably more important than being able to math the numbers yourself. You've got an excellent team and excellent software that helps with doing the math and doing the numbers.

And I think that's what we think when we think of data literacy, we think I should be able to do all the math. I don't think you actually need to be able to do all the math first. I think you need to be able to understand the words around the numbers and what they actually mean. first and then work with somebody or work with a piece of software to get at the math.

Richie Cotton: That's interesting. It's like natural language, data science. I like it. okay. what sort of training do, people who don't have data in their job title need around data?

Lilac Schoenbeck: I think, sometimes there's online, it's been so long since I've taken an online course, I won't pretend that I sort of, sort of knowing what, what these are, but I think that whatever makes you feel more comfortable understanding what's being placed in front of you is the right. thing, Like understanding basic maths, understanding how to read bar charts, understanding how to understand how to understand the elements of data, revenue, bookings, customers, churn, whatever pieces are relevant to your business, And that could be understanding the stocks and bonds, like whatever, whatever it happens to be. The piece for me is to lower the fear level. Lower the impenetrability level and whatever it takes. And for some of us, that's, we're delighted to take an online course and learn. For others of us, reading is the right, is the right approach.

For others of us, it's a conversation, right, with somebody who might be an expert. And what I find largely in organizations and among people is that if you turn to them with that little bit of vulnerability that says, hey, I'd really like to better understand bond math, they'll probably sit down with you and give you a basic primer.

And also, you've built a relationship. You've built a place to ask the follow up questions. I think that actually has such value. So that you're not feeling like you're alone in, in these corners of, frankly, often esoteric and very industry specific and very function specific pieces of data.

Richie Cotton: Yeah, I think that's kind of key that, Although maybe people like are worried that they're going to say something stupid around data, no one's going to laugh at you because you don't understand some data concept. I'm just going to explain it to you in general, unless you. Speak to someone very horrible, but that

Lilac Schoenbeck: There's no

Richie Cotton: very much

Lilac Schoenbeck: in data, are there?

Richie Cotton: think it's a rarity. Um, all right, so are.

there any, um, specific requirements around like data skills? You think that executives, um, have like, what did senior management need to know about data?

Lilac Schoenbeck: I mean, I would like them to be very facile with understanding the data. I don't know that that's a universal truth. I think we're, we're coming to that, right? And I think it probably, industry probably matters, and function probably matters. The degree to which that individual had to be, had to learn these skills during their experiences getting to that.

Getting to that role in that position when we finally look at, CEOs of public companies, they have to be facile with the stuff just because that's the sort of job requirement, but along the way, there's a lot of variability. I think it's evolving, though, right? I think in 20 years, you're going to see a different kind of facility across the board.

Richie Cotton: Okay. so it's good news. You think that, in general, the, the skill levels are getting better across management. you think that there's more understanding, at that level.

Lilac Schoenbeck: I think there is, and I think it will continue to be. That's my experience is that there's, it's more than there was 10 years ago, and it'll continue to grow.

Richie Cotton: Excellent. that's very good news. I think. okay. you mentioned, data culture a few times earlier. I'd like to dig into that a bit more. what do you think constitutes a good data culture? How would you describe it?

Lilac Schoenbeck: I think open, collaborative. I think those are the big pieces. It doesn't, like I said, there's data that has to stay confined. But I think a posture of openness, collaboration respect, I think that one of the biggest barriers to collaboration in organizations, particularly when one group has a set of very specialized and unique skills is that there, You can build a culture of innate superiority that is really detrimental to the entire business.

Understanding that the people that you're working with on the other side of the wall have their own specialized skills in a domain that perhaps you don't, And that, have that kind of trust and respect between organizations is gonna be the most important thing. I also think that there's an element worth considering as you structure a data team of expediency.

Because a lot of times when senior leaders are looking for additional data, they don't, they don't really feel they have three weeks for it, right? They actually would, if they could, if they could get the answer today, that's great. If they can get it this week, okay, Understanding and if they understand why they can't, that's also important, But understanding that a data request doesn't go into a queue and at some point, we'll call your number. That oftentimes for data to be relevant, it has to be available. It has to be fairly responsive. Because the business decisions that happen don't typically wait weeks and months, actually.

Like, we feel like we talk about things forever, but it's amazing how quickly we pivot and put a pin in something and say, this is truth. So having a very responsive data team that understands that that isn't a fickle. choice on the part of leadership, but rather that emergent question in that meeting that says, Well, actually, how many of our customers have X?

And if you could answer that for me today, I can make a different choice tomorrow. And I really am going to make that choice by Thursday.

Richie Cotton: Yeah, I think the lack between asking questions in an organization and having them answered is such a huge issue. And, ideally you want the answer immediately, but, that's a tricky place to get to sometimes. Are there any just simple changes or fixes you think an organization could make in terms of being more consistent or more effective in data driven decision making?

Lilac Schoenbeck: I think real clarity around how you communicate data so that everybody knows exactly what's being talked about. The words again around the data are really important. And also real clarity when you when you don't have data. One of the things we don't label is the absence of data very well. And it's It's okay not to have all the data, it's actually fine.

What's not okay is pretending it exists when it doesn't, Or pretending it is unknowable when it is knowable, right? there is a real intellectual honesty to saying, we would love to know what industry our customers are in, we don't have that data right now. And this is or is not what we're going to invest in.

fixing that, right? You get to choose as a company where you put your investment and where you put where you decide to fix things in the data footprint. I have seen too many times when people have just glossed over a gap or lamented a gap when nothing was actually going to be done to fix that.

And that's okay, right? Not everything will be fixed. it removes some of the weird shame and weight upon all the people in the system, right? Right. We recognize we will never know what industry our customers are in because of a series of foibles or the nature of our CRM or the phase of the moon.

And that's okay for us right now. We're going to move forward without it and make decisions without that information. And that kind of clarity and transparency, I think actually lowers the temperature of these sorts of questions and the fear associated with engaging with the data.

Richie Cotton: That's interesting. So like at the start we were talking about how you need to know what data you have and how you're using it. So actually it's equally important to know what data you don't have and you know sometimes be okay with, we're not going to get this. interesting. Okay, And are there any processes, you can put in place or changes to processes you can make in order to encourage good data usage?

Lilac Schoenbeck: I mean, I think just like we, like I said, I think I feel like I'm being a little repetitive, but I think labeling, clarity, right, let's not let any piece of communication go out without, what does this actually mean? Right? And I've actually seen, the marketing slides that say 85 percent of, of, of what, right?

And, having a rigor around the clarity of the data that is presented, I think is, absolutely one of the biggest pieces here. Right. Because it, to our first point, right, when we started at the start, like, the debate about what truth is, is really hampered by the idea that we actually don't even know what we're talking about.

And so having that clarity consistently within an organization and not allowing that to go out in the same way that every PowerPoint slide says confidential on the bottom, right, even if it's not. Um, I think that that, there's a, that is a rigorous practice within an organization that will help just check and balance.

on the whole thing really

Richie Cotton: Excellent. Definitely have seen that, confidential PowerPoint slide, thinking far too often. It's like, yeah, really? I'm not sure that's, that's very confidential. Um,

Lilac Schoenbeck: Our public ad campaign. I feel like not.

Richie Cotton: All right. so before we wrap up, is there anything, exciting that you're working on at Rocket?

Lilac Schoenbeck: So Rocket, we, we work very closely on the working with data and with content. And in particular, we have a data intelligence solution that helps with some of the problems we've described today. The problem of understanding where data lives within your organization and how it travels throughout your organization.

And I think it's just such an interesting technology to have built because what it requires is two very disparate pieces. It requires a really strong understanding of the back end systems, a lot of different applications, homegrown applications, languages even, right? Because the homegrown ones don't have an API necessarily.

So you've got to actually be able to understand how data moves within an application from a mainframe to a cloud system. That is a, Highly technical and esoteric topic, but then at the same time, everything else we've talked about data democratization. I don't want to present that in a way that's hairy and unpleasant.

So what I need to do is present a view of lineage and a view of the data in your organization that's actually accessible to people beyond the most technical I. T. team that would really appreciate the exact part of the database where you pulled that feel And so being able to span that deep, deep backend knowledge with a very easy to use tool that allows you to fly through the data in your org, like you'd fly through a Google map is something that we just recently released, we've had the backend for a long time but we recognize that need for data democratization and that need for orgs.

non technical users to be able to engage with the flow of data in their organization. And I'm just super excited about it at the piece. Honestly, that's that's wonderful for me is that I could use it, right? I'm not a data scientist. I wouldn't pretend to know where in the database I really want to find this field.

I can use this and say, Oh, I see the customer number gets transferred over here and then it gets obscured and then it comes out the other side. And if I want to change something, these are the places where I'd have to fix. and patch. That feels really empowering to me and really exciting.

Richie Cotton: All right, so this really comes back to the idea of understanding what data you have and where it is and how it relates to all your other bits of data. That just seemed incredibly useful. all right, just finally, do you have any, last advice for organizations wanting to improve their data usage?

Lilac Schoenbeck: I think, start small and try to set aside the fear. I don't think it's that scary, I think little, little bits of willing, engaged pockets of applications and people, Start with where the enthusiasm is and where the demand is. And incrementally grow from there. I think that's the best approach.

Because like you said, you get, you get some early wins. You get some demand. You get some enthusiasm. and from there, that can really motivate an organization.

Richie Cotton: All right, yeah, a few steps in the right direction is going to get you a long way. Excellent. All right, thank you very much for your time, Lilac.

Lilac Schoenbeck: Thank you so much, Richie. It's been fun.

Topics
Related

blog

How Data Discovery Tools Enable Data Democratization

Understand how top-performing companies solve data discovery challenges to drive data fluency at scale.

Kevin Babitz

7 min

blog

What Does Democratizing Data Mean? Unlocking the Power of Data Cultures

Learn about data democratization, why it matters, and how to achieve it. Explore how it can improve data literacy, empower individuals and businesses, and create a positive social impact.

Matt Crabtree

13 min

podcast

Democratizing Data in Large Enterprises

Meenal Iyer shares her thorough, effective, and clear strategy for democratizing data successfully and how that helps create a successful data culture in large enterprises

Adel Nehme's photo

Adel Nehme

43 min

podcast

[Radar Recap] From Data Governance to Data Discoverability: Building Trust in Data Within Your Organization with Esther Munyi, Amy Grace, Stefaan Verhulst and Malarvizhi Veerappan

Esther Munyi, Amy Grace, Stefaan Verhulst and Malarvizhi Veerappan focus on strategies for improving data quality, fostering a culture of trust around data, and balancing robust governance with the need for accessible, high-quality data.
Richie Cotton's photo

Richie Cotton

39 min

podcast

Building the Case for Data Literacy

Valerie Logan shares insights on what a successful data literacy journey looks like.
Adel Nehme's photo

Adel Nehme

38 min

podcast

Scaling Enterprise Analytics with Libby Duane Adams, Chief Advocacy Officer and Co-Founder of Alteryx

RIchie and Libby explore the differences between analytics and business intelligence, generative AI and its implications in analytics, the role of data quality and governance, Alteryx’s AI platform, data skills as a workplace necessity, and more. 
Richie Cotton's photo

Richie Cotton

43 min

See MoreSee More