Do you think it makes more sense to apply semantics when data is generated, or further down the data/analytics pipeline?

At data generation44%

Further down the pipeline52%

I don't know, but curious what others think5%

131 PARTICIPANTS
1.4k viewscircle icon1 Upvotecircle icon1 Comment
Sort by:
Executive Director of Technology in Healthcare and Biotech2 years ago

I think the approach varies depending on the nature of the data, such as whether it is structured or unstructured, and the quantity of data. In the healthcare domain where I work, I think performing semantic redefinition is most effective later in the pipeline. I would be concerned about performance if it were done upfront, especially given the terabytes of data we receive daily, much of which is non-discrete and unstructured. I also appreciate the flexibility to modify the semantic model later in the pipeline without the need to regenerate data.

Content you might like

Yes63%

No31%

Not yet, but we are planning to in 20214%

View Results

1-3 months9%

Less than 6 months30%

Between 6 and 12 months18%

Longer than 12 months31%

We can't have a completely remote workforce9%

View Results