Skip to content
View halim7youcef's full-sized avatar
  • Algeria , Biskra

Block or report halim7youcef

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
halim7youcef/README.md

Typing SVG


About Me

researcher = {
    "name"       : "Youcef Abdelhalim",
    "degree"     : "MSc Artificial Intelligence (M2) β€” Mohamed Khider University, Biskra",
    "focus"      : ["Medical Imaging", "Semantic & Instance Segmentation",
                    "Explainable AI (XAI)", "Multimodal Perception"],
    "publication": "ICECET 2026 β€” Post-Hoc Explainability for Generative Volumetric VLMs",
    "club"       : "Debug Scientific Club",
    "location"   : "Biskra, Algeria πŸ‡©πŸ‡Ώ",
}

I build interpretable, high-performance vision systems β€” from low-level CUDA kernels to multimodal RAG pipelines.
My research sits at the intersection of deep learning transparency and clinical relevance, with a focus on making model decisions trustworthy in medical contexts.


Highlights

πŸ₯‡ National Winner β€” Huawei ICT Competition, Cloud Track (2025)
πŸ₯‰ 3rd Place β€” National AI Hackathon, University of El Oued (2025)
πŸ“„ First-Author Paper accepted @ ICECET 2026, Rome, Italy
πŸ›°οΈ Team Lead β€” NASA Space Apps Challenge (2025)
πŸ₯‰ 3rd Place β€” CSBIS AI Competition, University of Mohamed Khider (2023)

πŸ”¬ Featured Projects

CT & MRI Multimodal Explainability for Generative Volumetric Models

Tied to the ICECET 2026 accepted paper.

  • Adapted post-hoc XAI techniques to generative volumetric multimodal models
  • Developed modality-specific attribution alignment and volumetric saliency aggregation across slices
  • Implemented gradient-based & perturbation-based attributions tailored to generative outputs
  • Consistency regularization across neighboring slices + anatomical overlay visualizations
  • Reproducible evaluation: quantitative faithfulness/localization metrics + expert qualitative assessment
  • Stack: PyTorch Β· Captum Β· NiBabel Β· SimpleITK
Prompt-Guided Image Segmentation (PromptSeg)
  • Lightweight multimodal framework generating pixel-accurate masks from free-text prompts
  • Fused frozen DINOv2 visual features with CLIP text embeddings via trainable SAM-based decoder (~9.3M params)
  • Trained on RefCOCO with multi-scale FPN fusion β†’ best validation IoU: 0.42
  • Documented failure modes; proposed LoRA fine-tuning + token-level text fusion improvements
CuVision Engine β€” Native C++/CUDA CV Framework
  • Low-latency CV framework targeting cuDNN & cuBLAS primitives directly β€” zero framework overhead
  • 20 custom CUDA kernels: classification, RetinaNet-based detection (FPN), Attention U-Net (ASPP)
  • Designed for edge deployment on NVIDIA Jetson hardware
Multimodal RAG Platform
  • Modular RAG pipeline over text + images + audio
  • Integrated Florence-2 (captioning) Β· Whisper (transcription) Β· DocLayout-YOLO (parsing)
  • E5-small-v2 embeddings + FAISS retrieval β†’ 94% faithfulness, 250ms avg latency
  • Deployed via Docker with MLflow tracking + LLM-as-a-judge evaluation suite

Tech Stack

Core
Python C++ CUDA

Deep Learning
PyTorch HuggingFace OpenCV

MLOps & Deployment
Docker FastAPI MLflow

Research Tools
FAISS NumPy


Research Interests

Computer Vision  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  Semantic / Instance Segmentation / Object Detection / Classification
Medical Imaging  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘  3D Volumetric Analysis
Explainable AI   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘  Grad-CAM Β· IG Β· Guided Backprop 
Multimodal AI    β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  Vision-Language Β· Vision Large-Language
Edge Systems     β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  CUDA Β· cuDNN Β· Jetson Deployment

Connect

Email Kaggle


GitHub Stats

GitHub Streak


Pinned Loading

  1. Perceptra_RT Perceptra_RT Public

    Real-time AV perception, object detection, multi-object tracking, monocular depth, sensor fusion and semantic segmentation on urban driving scenes.

    Python

  2. CuVision-Engine CuVision-Engine Public

    A low-latency computer vision framework written entirely in native C++/CUDA, engineered for maximum throughput on NVIDIA hardware with optimized implementations for classification, object detection…

    Cuda

  3. PromptSeg-Lightweight PromptSeg-Lightweight Public

    A modular multimodal framework that generates object masks from text prompts using a lightweight cross-modal decoder to fuse features from interchangeable vision and language backbones.

    Python

  4. Multimodal_RAG_Platform Multimodal_RAG_Platform Public

    A NotebookLM-inspired multimodal RAG platform for ingesting, encoding, and reasoning over text, images, and video with traceable, source-grounded generation.

    Python