CV

This is My CV

General Information

Full Name Omid Sadeghnezhad
Date of Birth 2nd February 1997
Languages Reading, writing and speaking competencies for English, Native Persian.

Research Interests

  • Computer Vision and Video Processing
  • High-Performance programming, Parallel Computing, and Multi-Processing
  • Generative AI, Multimodal generative models
  • Prompt engineering and management, Chain and agent architectures, Large Language Model (LLM) integration, Vector databases and semantic search, Retrieval-augmented generation (RAG)

Research Experience

  • PhotoGrammetry & Voice to Text Generation - 2025
  • Visual & Texual Retrieval-Augmented Generation - 2024
  • Evolutionary Algorithms and implementing Symbiotic Organisms Search algorithm (SOS) - 2023
  • GAN Developments survey and Analyze the Latent space. (Basic GAN, DCGAN ACGAN, WGAN, BigGAN, PGGAN, STYLEGAN, STYLEGAN2, StarGAN, SEAN) - 2022
  • Theories of Multimodal deep learning - 2021
  • Clothes virtual try-on models survey. (ACGPN, SwapNet, CP-VTON-PLUS) - 2020
  • Image harmonization and blending methods (Dovenet, DeepImageBlending), Brightness transfer (global and local transfer), color transfer methods (mean-std transfer, Lab mean transfer, and pdf transform), and color constancy with image to image translation (pix2pix, CycleGAN, and contrastive-unpaired-translation) - 2020
  • Image depth extractions (monoDepth, Pydnet) and salient object detection models (Basnet, U2Net, PoolNet, CPD) - 2020

Education

  • 2020-2023
    Master
    K. N. Toosi University of Technology
    • Artificial Intelligence.
    • Seminar
      • Generative Adversarial Networks (GAN) Survey
  • 2015-2020
    Bachelor
    Shahrood University of Technology
    • Electrical and Electronics Engineering

Experience

  • 2023-2025
    Team Lead & Python Developer
    Parstech.Co
    • Prototyped 4 services that were responsible for photogrammetric processing on drone images.
    • Architectured a photo aesthetic assessment service which was capable of visual RAG to analyse over 60 parameters with gathered specialized dataset.
    • Introduced and programmed a graph-based workflow to create a user-defined dialogue flow for a personalized chatbot that can be customized with LLMs and RAGs.
    • Leveraged asynchronous I/O to enhance server capacity and reduce response times.
    • Improved multi-processing, multi-threading, and asynchronous programming skills of 12 of my coworkers.
  • 2021-2023
    Machine Learning Engineer
    Parstech.Co
    • Optimized RAM and CPU of the camera streaming service by about 30%.
    • Implemented a streaming service that can connect to 25 HD cameras with 12 cores CPU and 2.5 GB RAM to proceed 20 frames per second.
    • Integrated a system for detecting personal protective equipment with 4 cameras capable of 5 categories.
    • Developed a task-oriented framework to deal with ML models and pipelines in the production environments.
    • Dockerized 3 AI and 5 Backend services to run reliably and securely in any infrastructure.
    • Trained 5 of my co-workers to lead the different product services.
  • 2020-2021
    Machine Learning Engineer
    TaraTech.Co
    • Designed a pipeline containing 5 ML models (object detection, layout composition, pose detection, depth estimation, and image blending) for human localization on the background image.
    • Trained an ACGAN model on the heart disease dataset to augment the data which achieved 70% accuracy in generating fake data.
  • 2019-2020
    Internship
    Telecommunication Company of Iran

Skills

Coding
Python, sql, C, C++, LATEX
Databases
MYSQL, SQlite, Redis, ElasticSearch
ML Frameworks
Pytorch, Scikit-Learn, Ray, LangChain
DevOps & Tools
Docker, Git, FastAPI, Celery
Computer Vision
GStreamer, OpenCV
Misc.
Multi Processing and Threading Programming , Managing Shared Memory , Implementing API , Task Queuing and Management , Image and Video Processing , Deep Neural Networks Implementation , Containerizing , Version Controling

Projects

  • 2025-2026
    PhotoGear
    • Orthomosaic photo generation from drone images with a pipeline of radiometric calibration, Structure from Motion (SfM), and Orthomosaic Generation.
    • Features
      • NDVI calculation and vegetation analysis
      • Structure from Motion (SfM) for 3D reconstruction with COLMAP and OpenSFM
      • Orthomosaic Generation with GDAL and PDAL
      • API Gateway and Queue management with FastAPI for data processing and task management
  • 2024-2025
    Photo Aesthetic Assessment
    • A platform to assess photo aesthetic parameters with LLM combined with VRAG and some computer vision pipelines.
    • Features
      • Visual-RAG to retrieve photo and the context with Langchain
      • MongoDB to search on embedding vectors
      • Async FastAPI endpoints
      • Clean architecture Design
      • Dependency injection and wiring with Lagom
  • 2022-2025
    Parstech Video Intelligent Assistant
    • AI assistance to process and analyze videos with modules like human detection, face recognition, and license plate recognition. I deeply worked on human detection and restriction area application, Camera handling and streaming, DevOps, and microservices of this application.
    • Features
      • Human detection and tracking
      • Video Processing with OpenCV
      • FastAPI implementation for APIs
      • Gstreamer and OpenCV cores for the Camera management service
      • Task queueing and workflow management with Celery
      • Containerized services
      • In-Memory data transfer with Redis and python shared memory
      • Multi-Processing and Multi-Threading features implemented
  • 2022
    LipReading model on Persian Dataset
    • Train a LipReading model on the Persian dataset.
  • 2021
    Personal Protective Equipment (PPE)
    • A program to identify workers’ safety equipment in workshop and construction environments, such as gloves, helmets, glasses, masks, safety vests, warning capabilities for people who enter prohibited areas, fire detection, and identification of work tools.
    • Features
      • Human detection and tracking
      • Pose estimation to check human body status
      • Body part localizer for head, hands, and chest
      • Equipment classifiers models
      • Pipeline implementation to estimate human fall
      • Faster functions with numba jit compiler
  • 2021
    Pressure Ulcer
    • Analysis of pressure sensor data of hospital beds for prediction and diagnosis of bed sores.
    • Features
      • Body Segmentation Models.
      • Pose estimation model for pressure sensor data
      • Video processing with OpenCV
      • Signal Capturing from human poses
  • 2021
    Cardiac Medical Data Augmentation
    • Train ACGAN on real Cardiac Medical Data in order to generate fake data for augmentation.
  • 2021
    Human localization
    • In two categories of foreground and background images, it takes a personal image from the foreground, and in order to place it in the background image, it first finds the right place then masks the image and places it in the background, and then corrects the color and light also improves the image by a model.
    • Features
      • Depth Estimation
      • Image brightness correction & Color correction models
      • Object detection models to find humans and other objects in the Image.
      • Image Perspective calculation
      • Image blending
  • 2021
    OCR
    • Optical Character Recognition on German administrative forms.
    • Features
      • Use tesseract and self-trained text recognition neural network.
      • Use Clustering algorithms to find lines.
      • Use OpenCV and Image processing methods to find information blocks.

Interests

  • Guitar
  • Video Game
  • Ping Pong
  • Natrue Lover
  • Psychology & Philosophy