Apply Conf 2022
5. DIY minimal feature store

5. DIY minimal feature store, João Santiago, Billie

https://www.youtube.com/watch?v=q4bZ0ixdUKk&ab_channel=Tecton (opens in a new tab)

  • Requirements
    • Easily maintainable
    • Same feature implementation during training and inference
    • Use past data in real-time models
    • Implement a feature once

Screen Shot 2022-05-23 at 14.58.22.png

  • Beamter drawbacks

    • Cool but no real-time data caching
    • Sync versions between project
  • Use Snowflake streams and tasks, define features as SQL function

    • No kafka, no spark, no new things to learn

    Screen Shot 2022-05-23 at 15.01.16.png

    • Fivetran delays here are a bottleneck, will be replaced with some events using Kafka

Screen Shot 2022-05-23 at 15.03.12.png

  • In summary, you can build features store with 3 components: Snowflake, Lambdas and Redis

Going further in the discussion: unlocking our data with a feature store (opens in a new tab)