Our agenda is packed with presentations, arranged into 6 categories – find your most desired topics!

You can choose whether you prefer to WATCH THE CONFERENCE ONLINE or JOIN US IN PERSON IN WARSAW. More presentations, more experts and more topics!

28.03.2023 - WORKSHOP DAY

8.30 - 9.00

8.30 - 9.00
Registration of participants at the hotel

9.00 - 16.00

9.00 - 16.00

PARALLEL WORKSHOPS (independent workshops, paid entry)

ONSITE
Warsaw Marriott Hotel, 2nd floor

All three independent workshops (paid entry) will take place the day before the conference, on March 28, onsite on the 2nd floor in the Warsaw Marriott Hotel in rooms: Wawel & Syrena, Ballroom E, Ballroom F. The conference rooms will be appropriately marked so that each workshop participant can easily find the selected speech. More about the location of the workshops HERE.

DESCRIPTION:

SESSION LEADER:

Data Analyst / Analytics Engineer
GetInData | Part of Xebia

Data Engineer
GetInData | Part of Xebia

DESCRIPTION:

SESSION LEADERS:

Machine Learning Engineer
GetInData | Part of Xebia
Senior MLOps Engineer
Printify

DESCRIPTION:

SESSION LEADER:

Data Engineer
GetInData | Part of Xebia

DESCRIPTION:

SESSION LEADER:

Data Analyst / Analytics Engineer
GetInData | Part of Xebia

Data Engineer
GetInData | Part of Xebia

DESCRIPTION:

SESSION LEADERS:

Machine Learning Engineer
GetInData | Part of Xebia
Senior MLOps Engineer
Printify

DESCRIPTION:

SESSION LEADER:

Data Engineer
GetInData | Part of Xebia

19.00 - 22.00

19.00 - 22.00
EVENING SPEAKERS MEETING (Only for Speakers)
ONSITE
Floor No2 restaurant, Marriott Hotel

Evening Meeting for Speakers. Let's meet! To talk, to meet new people, to exchange experience. We invite you for a face 2 face interaction onsite. The integration meeting will take place at the Floor No2 restaurant at the Marriott Hotel in the center of Warsaw. The event starts at 19:00.

29.03.2023 - 1ST CONFERENCE DAY | HYBRID: ONLINE + ONSITE

 

8.00 - 9.00

8.00 - 9.00

Registration of participants at the hotel; Morning cofee & breakfast and networking time

ONSITE
Warsaw Marriott Hotel, 2nd floor

9.00 - 9.15

9.00 - 11.05
Plenary session
9.00 - 9.15 | 15min
Conference opening
CEO & Meeting Designer
Evention
CEO and Co-founder
GetInData | Part of Xebia

9.15 - 9.40

Plenary session
9.15 - 9.40 | 25min
Cloud-based MLOps – story from banking sector

The banking sector continuously adopts new and more advanced analytical models. As the number of models grows it is more and more difficult to maintain the required time-to-market for putting the models into production. We will show how we are navigating through our cloud-based MLOps journey. It is sometimes difficult but in the end inevitable.

Director of SME Risk & Analytics
PKO Bank Polski

9.45 - 10.15

Plenary session
TechPoint Panel
9.45 - 10.15 | 30min
Advances in Big Data Processing: Exploring the latest and evolving tools and solutions

This panel will bring together leading experts from chosen vendors of big data solutions. They will share a deeper understanding of the latest trends, technologies and methodologies driving the big data industry, and leave with practical insights they can apply in their organizations. We will ask series of deep tech questions to all panelist.

Moderator:
Chief Data Architect
GetInData | Part of Xebia
Panel participants:
Snowflake Data Superhero, Co-founder and CxO at dataconsulting.pl
Snowflake
Customer Engineer (Smart Analytics & ML)
Google Cloud
Director of Product Management - Data in Motion
Cloudera

10.15 - 10.40

Plenary session
10.15 - 10.40 | 25min
Terraforming hardened Azure Databricks environments

#databricks #terraform #devops #security #azure

Creating Azure Databricks environment is as simple as “click of button” but how to ensure platform is secured and protected from data exfiltration? How can Infrastructure as Code and Terraform support platform hardening? How can DevOps accelerate your Bigdata projects? How BigData engineers can benefit from automated and secured platform? What are key configurations options to consider? What are pitfalls and limitations? What could be improved in 2023?

Associate IT Architect Director
Iqvia

10.40 - 11.05

Plenary session
10.40 - 11.05 | 25min
Intrum's Data and Analytics Journey and Future Vision

#data, #analytics, #dataarchitecture, #Vertica
Intrum is Europe’s undisputed market leading credit management company. We have been utilizing Big Data for years. It is a constant work in progress with challenges, improvements, and adjustments. We are bringing together data from all countries to enable data products on all levels. The presentation will go through our journey so far as well as what we will be looking at as we move forward to make sure that we are able to handle the ever-growing thirst for data. We will look at why we added Vertica to our data landscape on top of Hadoop, what Vertica capabilities we are leveraging today for optimal performance and management of data for 24 countries and what changes will be there as we look towards Cloud, for example, the use of delta format for raw data and Databricks for some of the processing as well as switch to Vertica EON on Kubernetes.

Data Architect
Intrum Global Technologies

11.05 - 11.35

11.05 - 11.35

BREAK

11.35 - 13.15

11.35 - 13.15
PARALLEL SESSIONS

Host

Senior Marketing Specialist, GetInData
Samantha System, Clueless Computing

Host

Data Engineer
GetInData | Part of Xebia

Host

Data Science Practice Lead
GetInData | Part of Xebia

Host

Development Director
Evention
Host of parallel session 1
Senior Marketing Specialist, GetInData
Samantha System, Clueless Computing
Host of parallel session 2
Data Engineer
GetInData | Part of Xebia
Host of parallel session 3
Data Science Practice Lead
GetInData | Part of Xebia
Host of parallel session 4
Development Director
Evention

11.35 - 12.05

Parallel session 1
Data Engineering
11.35 - 12.05 | 30min
Data Engineering - the most eclectic part of Machine Learning process.
Field Consultant
Ab Initio
Parallel session 2
Architecture, Operations and Cloud
11.35 - 12.05 | 30min
Volvo – Data Journey towards Cloud – Large scale data platform implementation
Manager of Cloud Monitoring Team, Data & AI Foundation
Volvo Group Digital & IT
Manager Design Authority & Platform, Data & AI Foundation
Volvo Group Digital & IT
Parallel session 3
Real-time streaming
11.35 - 12.05 | 30min
Control your data distribution like a pro with cloud-native NiFi architecture!
Director of Product Management - Data in Motion
Cloudera
Parallel session 4
AI, ML and Data Science
11.35 - 12.05 | 30min
How to start with Azure OpenAI?
AI Cloud Solution Architect
Microsoft

12.10 - 12.40

Parallel session 1
AI, ML and Data Science
12.10 - 12.40 | 30min
Machine learning techniques to make IT audit efficient
FCCA, CIA, CertDA, Manager of Technology and Operations IT Audit Team
ING Hubs Poland
Parallel session 2
Data Engineering, Data Strategy and ROI
12.10 - 12.40 | 30min
Data Mesh! Just a catchy concept or more than that? - Our Journey at Roche on building an enterprise level Data Mesh Platform.
IT Expert
Roche Informatics
Platform Architect
Roche Informatics
Parallel session 3
Architecture, Operations & Cloud, Data Strategy and ROI
12.10 - 12.40 | 30min
If data platforms are dead, then what next?
AWS Cloud Solutions Architect
BlueSoft
Head of Data Services
BlueSoft
Parallel session 4
AI, ML and Data Science, Architecture and Operations
12.10 - 12.40 | 30min
The Ultimate DataHub for all your BigData needs
SE Team Lead / PSE, CEE
Pure Storage

12.45 - 13.15

Parallel session 1
DATA ENGINEERING
12.45 - 13.15 | 30min
The Art of Dataset Design: How to Build Tables That Support Any Analysis
Staff Data Scientist
ex-Kry, ex-Spotify
Parallel session 2
Architecture, Operations and Cloud, Data Engineering
12:45 - 13:15 | 30min
Smart manufacturing in life science. Real-life use case of cloud based project in IoT
Parallel session 3
AI, ML and Data Science
12.45 - 13.15 | 30min
Living in Perfect Harmony - Where Music and Machine Learning Meet
Data Sciencist
Meta
Parallel session 4
Streaming and Real-Time Analytics
12.45 - 13.15 | 30min
Apache Flink: Introduction and new Features
Staff Software Engineer
Decodable

13.15 - 14.00

13.15 - 14.00

LUNCH BREAK

14.00 - 15.40

14.00 - 15.40
PARALLEL SESSIONS

14.00 - 14.30

Parallel session 1
Architecture, Operations and Cloud
14.00 - 14.30 | 30min
Ask Questions First, How We Flipped The Data Lake On It's Head
Head of R&D, Data Engineering Infrastructure
Wix.com
Parallel session 2
Data Engineering, Data Strategy and ROI
14.00 - 14.30 | 30min
What data engineering approach and culture fits your organisation better
Engineering Director
PepsiCo
Parallel session 3
AI, ML and Data Science
14.00 - 14.30 | 30min
Lean Recommendation Systems - The Road from “Hacky” PoCs to “Fancy” NNs
Senior Data Scientist
FREE NOW
Parallel session 4
Streaming and Real-Time Analytics
14.00 - 14.30 | 30min
Applying consumer-driven contract testing to safely compose data products in a data mesh
Professor of software engineering. Technology Consultant
HTW Berlin; Thoughtworks

14.35 - 15.05

Parallel session 1
MLOps
14.35 - 15.05 | 30min
Data Science meets engineering - the story of the MLOps platform that makes you productive, everywhere!
Machine Learning Engineer
GetInData | Part of Xebia
Senior MLOps Engineer
Printify
Parallel session 2
Data Engineering
14.35 - 15.05 | 30min
How to build reliable, observable and maintainable Data Pipelines based on Datavault built to last and serve BI and ML use cases alike
CEO
Alligator Company
Parallel session 3
AI, ML and Data Science
14.35 - 15.05 | 30min
Quality Over Quantity - Active Learning Behind the Scenes
Head of Research
Healthy.io
Parallel sesion 4
Architecture, Operations and Cloud, Streaming and Real-Time Analytics
14.35 - 15.05 | 30min
Near-Real-Time streaming applications monitoring and automated maintenance at billions of records/day scale: the architecture allowing us to sleep at night
Senior Data Engineer
Agile Lab

15.10 - 15.40

Parallel session 1
Architecture, Operations and Cloud
15.10 - 15.40 | 30min
On-prem to the cloud: challenges, common pitfalls, and lessons learned!
Principal - Data & Information Architect
GE Healthcare
Staff Data & Information Architect
GE Healthcare
Parallel session 2
Data Engineering
15.10 - 15.40 | 30min
Modern Big Data world challenge: how to not get lost among multiple data processing technologies?
Senior Software Engineer
Allegro.pl
Software Engineer
Allegro.pl
Parallel session 3
AI, ML and Data Science
15.10 - 15.40 | 30min
Common issues with Time Series and how to solve them
(Lead) Data Science Consultant firma: GoDataDriven
Xebia Data (Netherlands)
Parallel session 4
Data Strategy and ROI
15.10 - 15.40 | 30min
How to build a data community: The fine art of organizing Meetups
Analytics Engineer
Xebia

15.40 - 16.00

15.40 - 16.00

BREAK

16.00 - 16.25

16.00 - 16.50
Plenary session
ProjectPoint Panel
16.00 - 16.25 | 25min
Advanced in BigData Projects: what is hot, what is trendy, what is needed.

This panel will bring together practitioners from chosen enterprises that are utilizing big data projects. They will share experiences and lessons learned from implementation of big data interesting use cases - to show where we are headed in usage of big data technologies, tools and fulfillment of business needs. We will ask series of practical questions to all panelist that might inspire you in context of your organization.

Moderator:
Chief Growth Officer
GetInData | Part of Xebia
Panel participants:
Director
Xebia Data
Head of Data and Innovation
ING Hubs Poland
Platform Architect
Roche Informatics
IT&D Director, Platform & Product Stream Architect, Global Data & Analytics
Reckitt

16.25 - 16.50

Plenary session
16.25 - 16.50 | 25min
Optimizing signup user journeys with big data at Netflix.

How to build a good Netflix signup experience? In a consumer-facing product, user journeys - like account creation - have a direct impact on the business metrics. Come and learn how Netflix uses (big) data to continuously improve their user journeys. 

#DataAnalysis #UserJourneys #A/Bexperimentation

Staff Software Engineer
Netflix

16.50 - 17.45

16.50 - 17.45

ROUNDTABLES (ONSITE only)

Parallel roundtables discussions are the part of the conference that engage all participants. It has few purposes. First of all, participants have the opportunity to exchange their opinions and experiences about specific issue that is important to that group. Secondly, participants can meet and talk with the leader/host of the roundtable discussion – they are selected professionals with a vast knowledge and experience.

There will be one roundtable sessions, hence every conference participants can take part in 1 discussion.

 

 

Roundtable discussion
1. Does serverless data transformation and analytics pay off?

Serverless data transformation approach seems to be good choice for many if not for all use cases? Do you agree? Probably the answer is not so straight forward. Thus I would like to invite you to the discussion about that. We will share our real project experience, talking about chances and risks. We will analyse several perspectives including: available technology stacks, solution delivery time, overall cost, scale of solution, competences availability, vendor lock. See you soon!

Moderator:
Data Analytics Practice Manager
Onwelo
Roundtable discussion
2. Data Lake House & AI to empower business decisions

DataLakeHouse is a concept everybody knows. However how, together with AI it can support busniess decisions. Should DataLakeHouse still feed AI so it can make it own correct decisions in finance world. Let's discuss together, about core concepts, is datalake house really needed, and if AI at some point of time, can make proper investment decisions.

Moderator:
Head of Data Platform Engineering
Point72
Roundtable discussion
3. Digital Experimentation done right!

Digital experimentation: the ultimate weapon for businesses seeking a competitive edge in the world of tech. By using real-time data and cutting-edge tech, it optimizes digital offerings, enhances user experiences, and drives growth. Unlock the power of digital experimentation to take your digital game to the next level.

Moderator:
Senior Data Scientist
Kuehne+Nagel
Roundtable discussion
4. Business applications of generative AI

What we can observe today is the rapid growth of new AI models that achieve astonishing results in generating text, audio, images, and even source code. It becomes clear that the generative AI revolution has begun. The question is: are companies ready for it? Join me for a discussion – everyone will have a chance to share their thoughts, discuss experiences, exchange ideas, ask and answer questions.
Discussion points include e.g.:
● Which AI models have the potential for being applied by companies?
● How can they be used to gain profit?
● What are the risks of using them?
● How can we mitigate these risks?

Moderator:
Director, AI R&D
Pearson
Roundtable discussion
5. Perfect data streaming platform

What are the key characteristics of a self-service data streaming platform? Is ANSI SQL good enough? If not, what's missing? What about data discoverability? How data should be shared inside of an organization and how it should be secured? What else does your organization care deeply about?

Moderator:
Staff Software Engineer
Confluent
Roundtable discussion
6. Data Mesh in Practice

Data Mesh has been one of the most hyped buzzwords in the data space for the past 2-3 years. Everybody talks about it, but only a few have fully grasped the concepts behind it. Let's get together at this roundtable, to discuss our impressions and understandings of Data Mesh, what is at the core of its concepts, and what experiences we can share and learn from each other.

Moderator:
Associate Director of Data Engineering
HelloFresh
Roundtable discussion
7. Data Quality, Data Observability and DataOps

Description: There’s a lot of fuss around Data Quality, Data Observability and DataOps, but what they are all about?
How do they fit in a data platform or a data practice? What are the most relevant tools in the space?

Join this round table to discuss your experience and learn other people experience regarding Data Ops, Data Observability and Data Quality.

Moderator:
Data Architect
Agile Lab
Roundtable discussion
8. Why migrations in Data Platform are hard and (usually) take longer than expected

From on-prem Hadoop to GCS, Dataflow, and Bigquery, from TSV to protobuf, from EMR to Databricks, from EC2 to k8s, from Redshift to Delta and Presto. Sounds familiar? Let's talk about migrations in Data Platforms, how to do that, and why it's hard, costly, and usually takes longer than expected.
- Why does it take longer than initially estimated?
- Why do users often not want to / can't migrate now?
- Do you always need to run two data infrastructures (the old and the new one) in parallel? Will it cost twice as much as expected?
- Should we offload migration to the users or migrate on their behalf by task force teams?
- Why do we need to migrate again, haven't we finished the last migration like 3 months ago?

Moderator:
Engineering Manager, Data Platform
Bolt
Roundtable discussion
9. Responsible AI governance

At a time when various industries are increasingly embracing AI, the excitement around the opportunity may overshadow the wide-ranging ethical & legal implications. For example, an ed-tech startup may fast-track its way to operationalizing a language tutoring chatbot by using pre-trained language models containing built-in bias, which may lead to reputational damage if not managed properly. Similarly, a healthcare company may choose to implement a highly accurate black box model without realizing that the customers (or regulators) may require a solution that is transparent and interpretable––even if it is less accurate. In this session, we’ll share our approaches to these and other considerations around developing responsible AI.

Moderator:
Vice President of AI Learning Capabilities
Pearson
Roundtable discussion
10. MLOps frameworks and workflow orchestrators - bring your own experience!

MLOps landscape is rapidly evolving with new tools being released almost on a daily basis. How not to get lost in such a deluge of solutions? What are the main selection criteria and core features that such frameworks need to have? Is the Kubernetes platform always the preferred runtime environment? When would you go for fully managed services like Vertex AI pipelines or Azure ML pipelines? What approach would you choose in the case of a hybrid setup? To abstract, or not to abstract - does it make sense to have a unified pipeline API? What are your MLOps plans for 2023? Let’s brainstorm together!

Moderator:
Chief Data Architect
GetInData | Part of Xebia

17.45 - 18.00

17.45 - 18.00

SUMMARY & PRIZE GIVEAWAY

19.00 - 22.00

19.00 - 22.00
EVENING NETWORKING SESSION
ONSITE
Level27 club

Evening Meeting for all (*advance registration for the event is required)

Let's get together! To talk, to meet new people, to see old colleagues. We invite you for a face 2 face interaction onsite. The integration meeting will take place at the Level27 club in the center of Warsaw. The event starts at 19:00.
More information HERE.

30.03.2023 - 2TH CONFERENCE DAY | ONLINE only

 

9.30 - 12.00

9.30 - 12.00

PARALLEL TECHNICAL WORKSHOPS (all participants could join)

ONLINE

DESCRIPTION:

SESSION LEADERS:

Senior Data Product Manager
Reckitt

Data Solutions Architect
Reckitt

DESCRIPTION:

SESSION LEADER:

Lead BigData DW/BI Engineer
SoftServe

DESCRIPTION:

SESSION LEADER:

Data & AI Cloud Solution Architect
Microsoft

DESCRIPTION:

SESSION LEADERS:

Senior Data Product Manager
Reckitt

Data Solutions Architect
Reckitt

DESCRIPTION:

SESSION LEADER:

Lead BigData DW/BI Engineer
SoftServe

DESCRIPTION:

SESSION LEADER:

Data & AI Cloud Solution Architect
Microsoft

12.00 - 13.00

12.00 - 13.00

BREAK

13.00 - 13.10

13.00 - 13.35
Plenary session
13.00 - 13.10 | 10min
Opening
CEO & Meeting Designer
Evention
CEO and Co-founder
GetInData | Part of Xebia

13.10 - 13.35

Plenary session
13.10 - 13.35 | 25min
Hyperscaling growth made simple - Revolut's services and data architecture
Engineering Executive & Head of Engineering for Revolut Business
Revolut

13.40 - 15.20

13.40 - 15.20
PARALLEL SESSIONS

13.40 - 14.10

Parallel session 1
Data Engineering
16.40 - 17.10 | 30min
DataOps in action with Nessie, Iceberg and Great Expectations
Data Architect
Agile Lab
Parallel session 2
AI, ML and Data Science
13.40 - 14.10 | 30min
What Modern Data Analytics Platform can do for your Data Pipeline?
Data Scientist Lead INTL
Vertica by OpenText
Parallel session 3
Architecture, Operations and Cloud, Data Engineering
13.40 - 14.10 | 30min
Data Platform - a modern one. A new stack that promotes self-service with well-known best DataOps practices
Data Architect / Technical Product Owner
GetInData | Part of Xebia

14.15 - 14.45

Parallel sesion 1
Data Engineering
14.15 - 14.45 | 30min
Column-level lineage is coming to the rescue
Data Engineer
GetInData | Part of Xebia
Data Engineer
GetInData | Part of Xebia
Parallel session 2
MLOps,AI, ML and Data Science
14.15 - 14.45 | 30min
Influence of NLP in External social media data
Senior IT professional
Roche Informatics
Parallel session 3
Data Engineering, Streaming and Real-Time Analytics
14.15 - 14.45 | 30min
Challenges for streaming delivery of ordered data on AWS
Senior Data Engineer
Free2Move

14.50 - 15.20

Parallel session 1
MLOps
14.50 - 15.20 | 30min
Is the MLOps tooling the same as the DevOps tooling?
Solutions Architect
Chaos Gears
Parallel session 2
AI, ML and Data Science
14.50 - 15.20 | 30min
Degenerative feedback loops - how they emerge and why should we care?
Data Science Tech Lead
King (Candy Crush Saga)
Parallel session 3
Data Strategy and ROI
14.50 - 15.20 | 30min
How to make money with free data
Founder and Chief Data Wizard
Token Flow Insights SA

15.20 - 15.30

15.20 - 15.30

BREAK

15.30 - 17.10

15.30 - 17.10
PARALLEL SESSIONS

15.30 - 16.00

Parallel session 1
Data Engineering
15:30 - 16:00 | 30min
Modern Data Pipelines in AdTech—Life in the Trenches
Big Data Developer
Captify
Parallel session 2
AI, ML and Data Science
15.30 - 16.00 | 30min
AI for Anomaly Detection and Root Cause Analysis in network area
Data Scientist
Orange Polska
Parallel session 3
Streaming and Real-Time Analytics
15.30 - 16.00 | 30min
Where is my bottleneck? Performance troubleshooting in Flink
Staff Software Engineer
Confluent

16.05 - 16.35

Parallel session 1
MLOPs
16.05 - 16.35 | 30min
Speedup your MLOps with Rust
Solutions Architect
Bank Millennium
Parallel session 2
Streaming and Real-Time Analytics
16.05 - 16.35 | 30min
If it isn't hot, it doesn't deliver: Apache Pinot, Food Delivery, and why real-time analytics matter
Developer Advocate
StarTree
Parallel session 3
AI, ML and Data Science
16.05 - 16.35 | 30min
Do my ads still work? Using science and machine learning to build a modern marketing measurement stack.
Data Science Lead
Twigeo

16.40 - 17.10

Parallel session 1
Data Engineering
16.40 - 17.10 | 30min
An open standard for data lineage
Head of community
Astronomer
Parallel session 2
AI, ML and Data Science
16.40 - 17.10 | 30min
Chronon - an open source feature engineering framework
Staff Software Engineer
Airbnb
Parallel session 3
Streaming and Real-Time Analytics
16.40 - 17.10 | 30min
Monitor Fleet in Real-Time Using Stream SQL
CTO and co-founder
Timeplus

17.15 - 17.30

17.15 - 17.30
Plenary session
Summary & closing
CEO & Meeting Designer
Evention
CEO and Co-founder
GetInData | Part of Xebia

ONLINE EXPO + KNOWLEDGE ZONE

Free participation

We have great set of presentation available in the CONTENT ZONE that would be available pre-recorded as Video on Demand for conference participants in advance

Data Engineering – how do we do it at Allegro?
Team Leader
Allegro
Apache Iceberg: An Architectural Look Under the Covers
Developer Advocate
Dremio
Simple is better than complex – lessons learnt from building data platforms
Senior Data Engineer
GetInData | Part of Xebia
How can big data & AI models support the ways teams work: workload & well-being
CEO
Network Perspective
Improving therapeutic progress notes taking using A.I. based topic modeling
Senior Researcher and Data Scientist
Eleos Health
Senior Data Scientist
Eleos Health
Leveraging high-impact data science projects by creating reliable model feature warehouses
Data Architect
Next Reason

BIG DATA TECHNOLOGY
WARSAW SUMMIT

ORGANIZER

Evention sp. z o.o

Rondo ONZ 1 Str,

Warsaw, Poland

www.evention.pl

CONTACT

Weronika Warpas

© 2024 | This site uses cookies.