LIVE FOCUSLOCKED f/1.4 ISO100 ZOOM24mm
29.65°N 82.32°W --:--:--
01 · COMPUTER VISION RESEARCHER

Amir Etefaghi
Daryani

Ph.D. Candidate — Agricultural & Biological Engineering, University of Florida

I turn uncalibrated multi-view images into compact 3D worlds — query-based transformers and analytic superquadric primitive abstraction for spatial computing.

BASEDGainesville, FL
LABUF · Medeiros Lab
FOCUS3D Shape · Multi-View · Generative AI
AMIR_ETEFAGHI · 0.99
Portrait of Amir Etefaghi Daryani
SCROLL
A RIDE, ABSTRACTED — SCROLL TO CONTINUE
02 NOW — Current focus

What's running

Last calibrated: July 2026 · Gainesville, FL

As a Graduate Research Assistant working with Prof. Henry Medeiros at the University of Florida, I build systems that turn uncalibrated multi-view images directly into compact, interpretable 3D worlds — no camera calibration, no dense voxel grids, just analytic geometric primitives.

3D SHAPE · NEURIPS 2026

ViSQ

A query based multiview transformer utilizing a soft point to query assignment mechanism and a two stage decoder for predicting analytic superquadrics directly in world coordinates from RGB images.

OCCUPANCY · IN PROGRESS

SuperFormer

An end-to-end multi-view transformer for dense 3D semantic occupancy, replacing rigid voxel grids with interpretable superquadric primitive assemblies.

DETECTION · CVPR 2025

CaMuViD

Calibration-free multi-view object detection that fuses features across camera perspectives directly in image space — no bird's-eye view needed.

LAST MILESTONE: ViSQ submitted to NeurIPS 2026 — pending review
03 ABOUT — A brief dispatch

From pixels to primitives

Bridging 2D images and 3D spatial intelligence.

My research bridges 2D images and 3D spatial intelligence — designing query-based transformers and implicit neural decoders that reconstruct 3D shape and scene structure without relying on rigid camera calibration.

At the University of Florida, I work with Prof. Henry Medeiros on ViSQ and SuperFormer, frameworks that fuse self-supervised foundation models with geometric primitives to turn unstructured multi-view captures into compact, analytic 3D assemblies. Earlier work — CaMuViD and CLASP — focused on calibration-free multi-view detection and tracking; before that, I worked independently on generative facial synthesis and image forensics (E2F-GAN, IRL-Net).

Day to day that's Python and PyTorch, deformable attention, and a lot of thinking about how to represent the world with fewer, more meaningful numbers. I'm driven by turning complex, unstructured visual environments into clean 3D assets for the next generation of spatial computing and creative tools.

6PUBLISHED
PAPERS
3.95PH.D.
GPA
2CORE
PROJECTS
1NEURIPS
SUBMISSION
04 LOG — Research timeline

Frame by frame

A chronological capture of the journey so far.

2026
ViSQ submitted to NeurIPS
Vision-to-SuperQuadrics: a query-based multiview transformer for 3D shape abstraction, with Prof. Henry Medeiros.
FEB 2025
CaMuViD accepted to CVPR
Calibration-free multi-view detection work accepted at the Conference on Computer Vision and Pattern Recognition.
2023
Published in IJCB
Co-authored "Synthetic Face Generation via Eyes-to-Face Inpainting."
2023
Published in IEEE ACCESS
IRL-Net: inpainted region localization via spatial attention.
2023
Published in The Journal of Supercomputing
AdaInNet: an RL-based adaptive inference engine for distributed DNN offloading in IoT-FOG applications.
MAY 2023
Began the Ph.D. journey
Joined Prof. Henry Medeiros' lab at UF to start ViSQ, SuperFormer, CaMuViD, and CLASP.
JUL 2022
Defended master's thesis
Completed M.S. in Electrical Engineering at Amirkabir University of Technology, under Prof. Saeed Sharifian.
2022
Published in IEEE ACCESS
Co-authored E2FGAN: edge-aware coarse-to-fine GANs for facial inpainting.
05 PAPERS — Publications

Published work

NeurIPS · CVPR · IJCB · IEEE ACCESS · Journal of Supercomputing

01
ViSQ: Vision-to-SuperQuadrics Query-Based Multiview Transformer for 3D Shape Abstraction
with Prof. Henry Medeiros · Submitted to NeurIPS 2026
NEURIPS
02
CaMuViD: Calibration-Free Multi-View Detection
with Prof. Henry Medeiros · Conference on Computer Vision and Pattern Recognition (CVPR) · 2025
CVPR ↗
03
Synthetic Face Generation via Eyes-to-Face Inpainting
with A. Hassanpour et al. · IJCB · 2023
IJCB
04
IRL-Net: Inpainted Region Localization Network via Spatial Attention
IEEE ACCESS · 2023
ACCESS ↗
05
AdaInNet: RL-Based Adaptive Inference Engine for Distributed DNN Offloading in IoT-FOG Applications
with Prof. Saeed Sharifian · The Journal of Supercomputing · 2023
JoS ↗
06
E2FGAN: Edge-Aware Coarse-to-Fine GANs for Facial Inpainting
with A. Hassanpour et al. · IEEE ACCESS · 2022
ACCESS ↗
06 WORK — Projects in the field

Selected projects

From multi-view geometry to airport checkpoints.

3d shape · 0.98

ViSQ: Vision-to-SuperQuadrics

Query-based multiview transformer predicting analytic superquadric primitives directly in world coordinates from RGB images — no online 3D optimization needed.

PYTORCH · TRANSFORMERSSubmitted · NeurIPS 2026
occupancy · 0.94

SuperFormer: 3D Occupancy Abstraction

End-to-end multi-view transformer for dense 3D semantic occupancy, replacing rigid voxel grids with interpretable, analytic superquadric primitive assemblies.

PYTORCH · TRANSFORMERSIn progress
multi-view · 0.97CaMuViD project visualization

CaMuViD: Calibration-Free Multi-View Detection

Extends multi-view detection to work directly in each camera's image space, removing the need for calibration or a bird's-eye view while improving cross-view feature fusion.

PYTHON · PYTORCHLearn more →
tracking · 0.95CLASP project visualization

CLASP: Spatial-Temporal Instance Association

Real-time multi-object tracking pipeline that associates interacting people and belongings in high-density, heavily occluded airport video, funded by DHS S&T via ALERT.

PYTHON · PYTORCHIn progress
07 CONTACT — Get in touch

Let's talk about vision, robotics, or research.