Contact Tracing with Elasticsearch

1h 37m 3s
English
Paid

In this fascinating engineering project, you will learn to track user movements through their phone scans. The aim of the project is to use Elasticsearch as a search system to analyze a dataset in which 100,000 users visit stores and make 1,000,000 scans.

Read more about the course

You will create your own dataset using Python and Pandas, utilizing an open dataset of San Francisco stores containing over 140,000 stores with their names and coordinates. From this dataset, you will select 10,000 stores and generate 100,000 fictional users, each of whom will perform an average of 10 check-ins. After uploading the data to Elasticsearch, you will create a user interface with Streamlit for data visualization.

Your application interface includes:

  • Search by store name
  • Search by ZIP code to filter stores by area
  • Search by business ID for visit analysis
  • Search and track by Device ID to see where a specific user has been

In the course of working on the project, you will learn to:

  • Transform data and upload it in parquet format to Elasticsearch
  • Work with Kibana for index management and document search
  • Create an interactive interface with Streamlit featuring controls, Folium maps, and tables
  • Configure pages and execute queries to Elasticsearch

Course Program

  • Preparing the San Francisco dataset with 10,000 stores
  • Generating 100,000 fictional users
  • Merging user data with stores
  • Creating 1,000,000 app check-ins
  • Preparing data for upload to Elasticsearch
  • Uploading data to Elasticsearch
  • Developing a Streamlit application: maps, filters, tables
  • Page setup and working with Elasticsearch queries

Requirements

Before starting, it is recommended to take the course “Log Analysis in Elasticsearch” to understand the basics of working with Elasticsearch. Additionally, due to extensive work with data, it's advisable to complete the lessons on Pandas from the course “Python for Data Engineers”.

The project is designed for a computer with 8 GB of RAM.

Watch Online Contact Tracing with Elasticsearch

Join premium to watch
Go to premium
# Title Duration
1 Introduction 03:01
2 Setup & Goals 03:28
3 San Francisco dataset 03:49
4 Relational database vs elasticsearch 06:49
5 Preparing the dev environment 02:05
6 Prepare the SF dataset 1 09:48
7 Preparing the SF dataset 2 08:47
8 Creating 100k fake users 08:59
9 Merging 100k users with SF dataset 06:02
10 Creating app scans for users 08:22
11 Preparing Elasticsearch and loading the data 04:41
12 Creating the Streamlit app basics and folium maps 02:27
13 Page setup and querying from Elasticsearch 05:28
14 Creating free text search 04:58
15 Zip code search 02:24
16 Business_id search 04:03
17 Search by device ID & tracking people 03:38
18 Summary 03:53
19 Outlook 04:21

Similar courses to Contact Tracing with Elasticsearch

LeetCode In Python: 50 Algorithms Coding Interview Questions

LeetCode In Python: 50 Algorithms Coding Interview Questionsudemy

Category: Python
Duration 19 hours 36 minutes 13 seconds
Object-Oriented Programming

Object-Oriented Programmingprogrammingexpert.io

Category: Others, Python
Duration 4 hours 36 minutes 7 seconds
Spark and Python for Big Data with PySpark

Spark and Python for Big Data with PySparkudemy

Category: Python, Data processing and analysis
Duration 10 hours 35 minutes 43 seconds
Machine Learning A-Z : Become Kaggle Master

Machine Learning A-Z : Become Kaggle Masterudemy

Category: Python, Data processing and analysis
Duration 36 hours 23 minutes 54 seconds
Visual Studio Code for Python Developers

Visual Studio Code for Python DevelopersTalkpython

Category: Python, Visual Studio Code
Duration 4 hours 10 minutes 20 seconds
Build a Python REST API with the Django Rest Framework

Build a Python REST API with the Django Rest Frameworkudemy

Category: Python, Django
Duration 10 hours 8 minutes 56 seconds
Build an LLM-powered Q&A App using LangChain, OpenAI and Python

Build an LLM-powered Q&A App using LangChain, OpenAI and Pythonzerotomastery.io

Category: Python
Duration 2 hours 38 minutes 22 seconds
Complete linear algebra: theory and implementation

Complete linear algebra: theory and implementationudemy

Category: Python, Data processing and analysis
Duration 32 hours 53 minutes 26 seconds
Python on the Backend

Python on the Backendudemy

Category: Python
Duration 3 hours 51 minutes 33 seconds
Python for Data Engineers

Python for Data EngineersAndreas Kretz

Category: Python, Data processing and analysis
Duration 2 hours 21 minutes 18 seconds