Are data lakes a good way to manage big data?

4.1k views3 Upvotes11 Comments

Sort by:

vp information technology in Consumer Goods4 years ago

a principle of innovation is to avoid the next logical step. that is incrementalism. i think we should avoid data lakes or other designs that create yet another copy of data and another way to spend money on storing data. we should look to leverage AI snd ML to use data in place, place only the data needed, where it is needed when it is needed. consider ALL the data at all sites a global data environment that you can use and analyze without making copies.

Director of IT in Software4 years ago

Depends on the size of data, source of data and format. Data lakes are great for a large volume of unstructured data, coming from multiple sources and with various formats. It can ingest a large amount of data quickly and can quickly adjust to the data generation stream. If you have such a use case, then Data Lake will be a good option for you. Now the important thing here is that Data Lake just aggregate large amounts of data from multiple sources and it’s a cost-effective repository, but it does not give any intelligence natively from the data, you need to apply data science to get something useful out of it.

CISO in Software4 years ago

Data lakes are critical for XDR and SIEM to perform current and historical correlation and analysis.

Vice President - Global Head of Emerging Technologies & Digital Innovation4 years ago

The data lake is just a beginning towards modern data management. The important thing is how you curate your lake, making it apt for data analysis. Every modern data management initiatives need to fulfill the demand from three sections of users - Data Analyst, Data Scientist, and Business users. The objective is to make the data usable and available with proper segmentation and security.

Director of Technology in Government4 years ago

Data Lakes can be very beneficial for entities that need to increase operational efficiencies and innovation but if there is no oversight or no purpose of the contents then it can create increased compute costs, complexity and data integrity loss.

5 1 Reply

no title4 years ago

Agreed. Must have a viable and flexible architecture and plan around short, mid, and long term operations and growth, coupled with the various types, categorization, and cross functional operations on the data collection, persistence, access, and retention process. <br><br>process).<br><br>

Content you might like

Which tech conference do you intend to attend in person next year or have already participated in this year, and what motivated your choice?

What do you see as the biggest IT challenge in 2023 regarding digital transformation?

Budget/Investment22%

Staff47%

C-Suite buy-in20%

Cyber Security7%

Time3%

View Results

Why do you think there are so few mature AI-driven autonomous pentesting solutions on the market, and why does this topic seem to generate more hype than in-depth technical discussion?

Which billing frequency do the largest percentage of your SaaS apps fall into?

Monthly46%

Quarterly35%

Annually18%

View Results

Which pitfalls—model bias, false positives/negatives, data quality, regulatory constraints—often impede AI-based security tools, and how can they be mitigated in a financial-services context?

Are data lakes a good way to manage big data?

Sort by:

Content you might like

Which tech conference do you intend to attend in person next year or have already participated in this year, and what motivated your choice?

What do you see as the biggest IT challenge in 2023 regarding digital transformation?

Why do you think there are so few mature AI-driven autonomous pentesting solutions on the market, and why does this topic seem to generate more hype than in-depth technical discussion?

Which billing frequency do the largest percentage of your SaaS apps fall into?

Which pitfalls—model bias, false positives/negatives, data quality, regulatory constraints—often impede AI-based security tools, and how can they be mitigated in a financial-services context?

What sets us apart?

RELATED ONE-MINUTE INSIGHTS

CrowdStrike Outage: Impact And Recovery

Data-Driven Customer Experience: Uniting D&A and CX Teams

2024 Marketing Priorities and Challenges: Insights from the Field

Data and Analytics Priorities and Challenges: 2024 Trends

Generative AI and Software Engineering Teams: Adoption and Training

Take Your Insights On-the-Go