TLDR Maternal Health

Project Overview

TLDR Maternal Health is a comprehensive project that harnesses the power of a distributed system and natural language processing (NLP) techniques to cater to the informational needs of pregnant women. NLP techniques are employed to analyze and process textual information related to pregnancy, childbirth, and maternal health. This involves tasks such as text summarization, sentiment analysis, and information extraction from medical literature and online resources. The ultimate goal of the project is to develop an interactive user interface that offers essential guidance and support throughout the stages of pregnancy.

Design

This section shows a visual design of the primary components of TLDR Maternal Health, which are Azure Virtual Machines, a PostgreSQL Database, and Python Transformers.

Azure Virtual Machines

PostgreSQL Database

Docker

Python Transformers

Originally the proposal suggested using SpaCy for text summarization, but this NLP library didn't have the best precision. The final chosen text summarization technique is a Bart Transformer model that processes slightly slower but generates incredible summaries that take key insights.

Implementation Instructions

This section provides step-by-step instructions for setting up the project environment.

Azure Virtual Machines Setup

Instructions:

Here are the steps summarized for your implementation instructions section:

Create Virtual Machine 1 or Server VM

Size
- Size: Standard B2s
- vCPUs: 2
- RAM: 4 GiB

Create Virtual Machine 2 or Client VM

Size
- Size: Standard B1ms
- vCPUs: 1
- RAM: 2 GiB

Create a Connection Monitor

Protocol: TCP
Destination Port: 22

Set up Azure Network Watcher:
- Enable Network Watcher for your Azure subscription and select the appropriate region.
Monitoring VM-to-VM Communication:
- Access the Network Watcher dashboard and select the "VM1 to VM2" option.

Python Connection

# Server VM

# client socket
    c = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
    server_address = (('10.0.0.5', 8888))
    
    # connect to the server
    c.connect(server_address)
    
    # request tldr summaries
    if type == 'tldr':
        Model.begin() 
        with open('tldr.json', 'r') as json_file:
            data = json.load(json_file)
    else:
        print('Invalid Request Type')    
        return

    # send data to client and close the connection
    c.sendall(data.encode())
    c.close()

# Client VM

# server socket
    s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
    s.bind(('10.0.0.5',8888))
    s.listen(1)
    
    print('Server is listening...')
    
    # accept incoming connections from client
    c_socket, c_address = s.accept()
    print(f'Connection from {c_address}')
    
    # receive data from the client
    data = c_socket.recv(1024).decode()
    print(f'Received from client: {data}')

PostgreSQL Database Setup

# Update the package index
sudo apt update

# Install PostgreSQL and its dependencies
sudo apt install postgresql postgresql-contrib

# Check the status of PostgreSQL service
sudo systemctl status postgresql

# By default, PostgreSQL creates a Linux user named "postgres"
# Switch to the "postgres" user
sudo su - postgres

# Access the PostgreSQL command-line interface (psql)
psql

# Create a new database user and set a password
CREATE USER azureuser WITH PASSWORD '...';

# Create a new database and assign ownership to the newly created user
CREATE DATABASE database OWNER azureuser;

# Grant all privileges on the database to the user
GRANT ALL PRIVILEGES ON DATABASE database TO azureuser;

# Exit the PostgreSQL command-line interface
\q

# Exit from the "postgres" user session
exit

Docker Setup

# Update the package index
sudo apt update

# Install Docker prerequisites
sudo apt install apt-transport-https ca-certificates curl software-properties-common

# Add the Docker GPG key
curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -

# Add the Docker repository
sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable"

# Update the package index again
sudo apt update

# Install Docker Engine
sudo apt install docker-ce

# Verify Docker installation by checking the version
docker --version

exit

Usage

This section explains how to use the project after it has been set up.

Virtual Machine 1 or Server VM

ssh -v -i file1.pem azureuser@address
cd ServerFolder
docker build -t server-image . 
docker run server-image python server-side.py

Virtual Machine 2 or Client VM

ssh -v -i file2.pem azureuser@address
cd ClientFolder
docker build -t client-image . 
docker run client-image python client-side.py

Project Structure

This section provides an overview of the project directory structure.

project/
│
├── ServerFolder/
│   ├── Dockerfile
│   ├── Model.py
│   ├── server-side.py
│   ├── tldr.json
│   └── requirements.txt
│
├── ClientFolder/
│   ├── Dockerfile
│   ├── entrypoint.sh
│   ├── input-tldr-data.sql
│   ├── requirements.txt
│   ├── client-side.py
│   ├── server.py
│   └── output_data.csv
│
├── tests/
│   ├── rouge.ipynb
│   └── test_stat.ipynb
│
└── README.md

Key Measurement Results

This section outlines how performance improved throughout the development of this project.

Spacy to BART

Virtual Machine Resources

This project consisted of two data-heavy operations: a Transformer model that requires a large library installation and the database contents. Placing these tasks in separate virtual machines allowed for purchasing two inexpensive virtual machines that execute their task in parallel.

Demo

Original Article: https://www.nhs.uk/pregnancy/week-by-week/1-to-12/1-2-3-weeks/

What did I learn?

How to create Azure Virtual Machines
- Connection Monitor
- Network Watcher
- Defining allocated resources
- Upkeeping resources
Cleaning up the machine for additional storage -> Estimating the amount of storage required
Creating a Dockerfile for the requirements of multiple machines
Interacting between Client & Server Virtual Machines
Using Transformers for summarization
Using Python for Client-Server interaction

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
ClientFolder		ClientFolder
ServerFolder		ServerFolder
Test		Test
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TLDR Maternal Health

Table of Contents

Project Overview

Design

Azure Virtual Machines

PostgreSQL Database

Docker

Python Transformers

Implementation Instructions

Azure Virtual Machines Setup

Python Connection

PostgreSQL Database Setup

Docker Setup

Usage

Virtual Machine 1 or Server VM

Virtual Machine 2 or Client VM

Project Structure

Key Measurement Results

Spacy to BART

Virtual Machine Resources

Demo

What did I learn?

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TLDR Maternal Health

Table of Contents

Project Overview

Design

Azure Virtual Machines

PostgreSQL Database

Docker

Python Transformers

Implementation Instructions

Azure Virtual Machines Setup

Python Connection

PostgreSQL Database Setup

Docker Setup

Usage

Virtual Machine 1 or Server VM

Virtual Machine 2 or Client VM

Project Structure

Key Measurement Results

Spacy to BART

Virtual Machine Resources

Demo

What did I learn?

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages