CV | Omid Sadeghnezhad

General Information

Full Name	Omid Sadeghnezhad
Date of Birth	2nd February 1997
Languages	Reading, writing and speaking competencies for English, Native Persian.

Research Interests

Computer Vision and Video Processing
High-Performance programming, Parallel Computing, and Multi-Processing
Generative AI, Multimodal generative models
Prompt engineering and management, Chain and agent architectures, Large Language Model (LLM) integration, Vector databases and semantic search, Retrieval-augmented generation (RAG)

Research Experience

Retrieval-Augmented Generation - 2024
Evolutionary Algorithms and implementing Symbiotic Organisms Search algorithm (SOS) - 2023
GAN Developments survey and Analyze the Latent space. (Basic GAN, DCGAN ACGAN, WGAN, BigGAN, PGGAN, STYLEGAN, STYLEGAN2, StarGAN, SEAN) - 2022
Theories of Multimodal deep learning - 2021
Clothes virtual try-on models survey. (ACGPN, SwapNet, CP-VTON-PLUS) - 2020
Image harmonization and blending methods (Dovenet, DeepImageBlending), Brightness transfer (global and local transfer), color transfer methods (mean-std transfer, Lab mean transfer, and pdf transform), and color constancy with image to image translation (pix2pix, CycleGAN, and contrastive-unpaired-translation) - 2020
Image depth extractions (monoDepth, Pydnet) and salient object detection models (Basnet, U2Net, PoolNet, CPD) - 2020

Education

2020-2023
Master

K. N. Toosi University of Technology
- Artificial Intelligence.
- Seminar
  - Generative Adversarial Networks (GAN) Survey
2015-2020
Bachelor

Shahrood University of Technology
- Electrical and Electronics Engineering

Experience

2021-present
Machine Learning Engineer

Parstech.Co
- Optimized RAM and CPU of the camera streaming service by about 30%.
- Implemented a streaming service that can connect to 25 HD cameras with 12 cores CPU and 2.5 GB RAM to proceed 20 frames per second.
- Integrated a system for detecting personal protective equipment with 4 cameras capable of 5 categories.
- Developed a task-oriented framework to deal with ML models and pipelines in the production environments.
- Dockerized 3 AI and 5 Backend services to run reliably and securely in any infrastructure.
- Trained 5 of my co-workers to lead the different product services.
2020-2021
Machine Learning Engineer

TaraTech.Co
- Designed a pipeline containing 5 ML models (object detection, layout composition, pose detection, depth estimation, and image blending) for human localization on the background image.
- Trained an ACGAN model on the heart disease dataset to augment the data which achieved 70% accuracy in generating fake data.
2019-2020

Internship

Telecommunication Company of Iran

Skills

Coding
	Python, sql, C, C++, LATEX
Databases
	MYSQL, SQlite, Redis, ElasticSearch
ML Frameworks
	Pytorch, Scikit-Learn, Ray, LangChain
DevOps & Tools
	Docker, Git, FastAPI, Celery
Computer Vision
	GStreamer, OpenCV
Misc.
	Multi Processing and Threading Programming , Managing Shared Memory , Implementing API , Task Queuing and Management , Image and Video Processing , Deep Neural Networks Implementation , Containerizing , Version Controling

Projects

2022-present
Parstech Video Intelligent Assistant

Intelligent Security Assistant
- AI assistance to process and analyze videos with modules like human detection, face recognition, and license plate recognition. I deeply worked on human detection and restriction area application, Camera handling and streaming, DevOps, and microservices of this application.
- Features
  - Human detection and tracking
  - Video Processing with OpenCV
  - FastAPI implementation for APIs
  - Gstreamer and OpenCV cores for the Camera management service
  - Task queueing and workﬂow management with Celery
  - Containerized services
  - In-Memory data transfer with Redis and python shared memory
  - Multi-Processing and Multi-Threading features implemented
2022
LipReading model on Persian Dataset
- Train a LipReading model on the Persian dataset.
2021
Personal Protective Equipment (PPE)

Implementation of a system to identify personal safety equipment
- A program to identify workers’ safety equipment in workshop and construction environments, such as gloves, helmets, glasses, masks, safety vests, warning capabilities for people who enter prohibited areas, ﬁre detection, and identiﬁcation of work tools.
- Features
  - Human detection and tracking
  - Pose estimation to check human body status
  - Body part localizer for head, hands, and chest
  - Equipment classiﬁers models
  - Pipeline implementation to estimate human fall
  - Faster functions with numba jit compiler
2021
Pressure Ulcer
- Analysis of pressure sensor data of hospital beds for prediction and diagnosis of bed sores.
- Features
  - Body Segmentation Models.
  - Pose estimation model for pressure sensor data
  - Video processing with OpenCV
  - Signal Capturing from human poses
2021
Cardiac Medical Data Augmentation
- Train ACGAN on real Cardiac Medical Data in order to generate fake data for augmentation.
2021
Human localization
- In two categories of foreground and background images, it takes a personal image from the foreground, and in order to place it in the background image, it ﬁrst ﬁnds the right place then masks the image and places it in the background, and then corrects the color and light also improves the image by a model.
- Features
  - Depth Estimation
  - Image brightness correction & Color correction models
  - Object detection models to ﬁnd humans and other objects in the Image.
  - Image Perspective calculation
  - Image blending
2021
OCR
- Optical Character Recognition on German administrative forms.
- Features
  - Use tesseract and self-trained text recognition neural network.
  - Use Clustering algorithms to ﬁnd lines.
  - Use OpenCV and Image processing methods to ﬁnd information blocks.

Interests

Guitar
Video Game
Ping Pong
Natrue Lover
Psychology & Philosophy

General Information

Research Interests

Research Experience

Education

Master

Bachelor

Experience

Machine Learning Engineer

Machine Learning Engineer

Internship

Skills

Projects

Parstech Video Intelligent Assistant

LipReading model on Persian Dataset

Personal Protective Equipment (PPE)

Pressure Ulcer

Cardiac Medical Data Augmentation

Human localization

OCR

Interests