Skip to main content

Contact Tracing with Elasticsearch

1h 37m 3s
English
Paid

Course description

In this fascinating engineering project, you will learn to track user movements through their phone scans. The aim of the project is to use Elasticsearch as a search system to analyze a dataset in which 100,000 users visit stores and make 1,000,000 scans.

Read more about the course

You will create your own dataset using Python and Pandas, utilizing an open dataset of San Francisco stores containing over 140,000 stores with their names and coordinates. From this dataset, you will select 10,000 stores and generate 100,000 fictional users, each of whom will perform an average of 10 check-ins. After uploading the data to Elasticsearch, you will create a user interface with Streamlit for data visualization.

Your application interface includes:

  • Search by store name
  • Search by ZIP code to filter stores by area
  • Search by business ID for visit analysis
  • Search and track by Device ID to see where a specific user has been

In the course of working on the project, you will learn to:

  • Transform data and upload it in parquet format to Elasticsearch
  • Work with Kibana for index management and document search
  • Create an interactive interface with Streamlit featuring controls, Folium maps, and tables
  • Configure pages and execute queries to Elasticsearch

Course Program

  • Preparing the San Francisco dataset with 10,000 stores
  • Generating 100,000 fictional users
  • Merging user data with stores
  • Creating 1,000,000 app check-ins
  • Preparing data for upload to Elasticsearch
  • Uploading data to Elasticsearch
  • Developing a Streamlit application: maps, filters, tables
  • Page setup and working with Elasticsearch queries

Requirements

Before starting, it is recommended to take the course “Log Analysis in Elasticsearch” to understand the basics of working with Elasticsearch. Additionally, due to extensive work with data, it's advisable to complete the lessons on Pandas from the course “Python for Data Engineers”.

The project is designed for a computer with 8 GB of RAM.

Watch Online

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 19 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Introduction

All Course Lessons (19)

#Lesson TitleDurationAccess
1
Introduction Demo
03:01
2
Setup & Goals
03:28
3
San Francisco dataset
03:49
4
Relational database vs elasticsearch
06:49
5
Preparing the dev environment
02:05
6
Prepare the SF dataset 1
09:48
7
Preparing the SF dataset 2
08:47
8
Creating 100k fake users
08:59
9
Merging 100k users with SF dataset
06:02
10
Creating app scans for users
08:22
11
Preparing Elasticsearch and loading the data
04:41
12
Creating the Streamlit app basics and folium maps
02:27
13
Page setup and querying from Elasticsearch
05:28
14
Creating free text search
04:58
15
Zip code search
02:24
16
Business_id search
04:03
17
Search by device ID & tracking people
03:38
18
Summary
03:53
19
Outlook
04:21

Unlock unlimited learning

Get instant access to all 18 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Comments

0 comments

Want to join the conversation?

Sign in to comment

Similar courses

Python for Business Data Analytics & Intelligence

Python for Business Data Analytics & Intelligence

Sources: zerotomastery.io
Become a top Business Data Analyst. We’ll teach you everything you need to go from a complete beginner to getting hired as an analytics professional. You’ll lea
15 hours 25 minutes 6 seconds
Rock Solid Python with Python Typing Course

Rock Solid Python with Python Typing Course

Sources: Talkpython
When Python was originally invented way back in 1989, it was a truly dynamic and typeless programming language. But that all changed in Python 3.5 when type "hints" were added t...
4 hours 27 minutes 54 seconds
Mathematical Foundations of Machine Learning

Mathematical Foundations of Machine Learning

Sources: udemy
Mathematics forms the core of data science and machine learning. Thus, to be the best data scientist you can be, you must have a working understanding of the mo
16 hours 25 minutes 26 seconds
Automated Software Testing with Python

Automated Software Testing with Python

Sources: udemy
Testing automation doesn't have to be painful. Software testing is an essential skill for any developer, and I'm here to help you truly understand all types of
13 hours 26 minutes 55 seconds
Introduction to Python

Introduction to Python

Sources: Amit Jain
In Data Engineering, programming plays a key role. Whether you are working with interfaces, databases, or engaged in transformation...
1 hour 18 minutes 14 seconds