LIVE FOCUSLOCKED f/1.4 ISO100 ZOOM24mm

29.65°N 82.32°W --:--:--

01 · COMPUTER VISION RESEARCHER

Amir Etefaghi
Daryani

Ph.D. Student — Agricultural & Biological Engineering, University of Florida

I turn multi-view images into compact 3D worlds — query-based transformers and analytic superquadric primitive abstraction for spatial computing.

BASEDGainesville, FL

LABUF · Medeiros Lab

FOCUS3D Shape · Multi-View · Generative AI

Email me Resume ↗ Google Scholar ↗ GitHub ↗

AMIR_ETEFAGHI · 0.99

SCROLL

A RIDE, ABSTRACTED — SCROLL TO CONTINUE

02 NOW — Current focus

What's running

Last calibrated: July 2026 · Gainesville, FL

As a Graduate Research Assistant working with Prof. Henry Medeiros at the University of Florida, I build systems that turn multi-view images directly into compact, interpretable 3D worlds — no camera calibration, no dense voxel grids, just analytic geometric primitives.

3D SHAPE · IN PROGRESS

ViSQ

A query based multiview transformer utilizing a soft point to query assignment mechanism and a two stage decoder for predicting analytic superquadrics directly in world coordinates from RGB images.

OCCUPANCY · IN PROGRESS

SuperFormer

An end-to-end multi-view transformer for dense 3D semantic occupancy, replacing rigid voxel grids with interpretable superquadric primitive assemblies.

DETECTION · CVPR 2025

CaMuViD

Calibration-free multi-view object detection that fuses features across camera perspectives directly in image space — no bird's-eye view needed.

LAST MILESTONE: ViSQ submitted to NeurIPS 2026 — pending review

03 ABOUT — A brief dispatch

From pixels to primitives

Bridging 2D images and 3D spatial intelligence.

My research bridges 2D images and 3D spatial intelligence — designing query-based transformers and implicit neural decoders that reconstruct 3D shape and scene structure without relying on rigid camera calibration.

At the University of Florida, I work with Prof. Henry Medeiros on ViSQ and SuperFormer, frameworks that fuse self-supervised foundation models with geometric primitives to turn unstructured multi-view captures into compact, analytic 3D assemblies. Earlier work — CaMuViD and CLASP — focused on calibration-free multi-view detection and tracking; before that, I worked independently on generative facial synthesis and image forensics (E2F-GAN, IRL-Net).

Day to day that's Python and PyTorch, deformable attention, and a lot of thinking about how to represent the world with fewer, more meaningful numbers. I'm driven by turning complex, unstructured visual environments into clean 3D assets for the next generation of spatial computing and creative tools.

6PUBLISHED
PAPERS

3.95PH.D.
GPA

2CORE
PROJECTS

04 LOG — Research timeline

Frame by frame

A chronological capture of the journey so far.

FEB 2025

CaMuViD accepted to CVPR

Calibration-free multi-view detection work accepted at the Conference on Computer Vision and Pattern Recognition.

2023

Published in IJCB

Co-authored "Synthetic Face Generation via Eyes-to-Face Inpainting."

2023

Published in IEEE ACCESS

IRL-Net: inpainted region localization via spatial attention.

2023

Published in The Journal of Supercomputing

AdaInNet: an RL-based adaptive inference engine for distributed DNN offloading in IoT-FOG applications.

MAY 2023

Began the Ph.D. journey

Joined Prof. Henry Medeiros' lab at UF to start ViSQ, SuperFormer, CaMuViD, and CLASP.

JUL 2022

Defended master's thesis

Completed M.S. in Electrical Engineering at Amirkabir University of Technology, under Prof. Saeed Sharifian.

2022

Published in IEEE ACCESS

Co-authored E2FGAN: edge-aware coarse-to-fine GANs for facial inpainting.

05 PAPERS — Publications

Published work

CVPR · IJCB · IEEE ACCESS · Journal of Supercomputing

CaMuViD: Calibration-Free Multi-View Detection

with Prof. Henry Medeiros · Conference on Computer Vision and Pattern Recognition (CVPR) · 2025

CVPR ↗

Synthetic Face Generation via Eyes-to-Face Inpainting

with A. Hassanpour et al. · IJCB · 2023

IJCB

IRL-Net: Inpainted Region Localization Network via Spatial Attention

IEEE ACCESS · 2023

ACCESS ↗

AdaInNet: RL-Based Adaptive Inference Engine for Distributed DNN Offloading in IoT-FOG Applications

with Prof. Saeed Sharifian · The Journal of Supercomputing · 2023

JoS ↗

E2FGAN: Edge-Aware Coarse-to-Fine GANs for Facial Inpainting

with A. Hassanpour et al. · IEEE ACCESS · 2022

ACCESS ↗

06 WORK — Projects in the field

Selected projects

From multi-view geometry to airport checkpoints.

3d shape · 0.98

ViSQ: Vision-to-SuperQuadrics

Query-based multiview transformer predicting analytic superquadric primitives directly in world coordinates from RGB images — no online 3D optimization needed.

PYTORCH · TRANSFORMERSIN PROGRESS

occupancy · 0.94

SuperFormer: 3D Occupancy Abstraction

End-to-end multi-view transformer for dense 3D semantic occupancy, replacing rigid voxel grids with interpretable, analytic superquadric primitive assemblies.

PYTORCH · TRANSFORMERSIn progress

multi-view · 0.97 CaMuViD project visualization

CaMuViD: Calibration-Free Multi-View Detection

Extends multi-view detection to work directly in each camera's image space, removing the need for calibration or a bird's-eye view while improving cross-view feature fusion.

PYTHON · PYTORCHLearn more →

tracking · 0.95 CLASP project visualization

CLASP: Spatial-Temporal Instance Association

Real-time multi-object tracking pipeline that associates interacting people and belongings in high-density, heavily occluded airport video, funded by DHS S&T via ALERT.

PYTHON · PYTORCHIn progress

07 CONTACT — Get in touch

Let's talk about vision, robotics, or research.

amir.etefaghidar@ufl.edu ·Resume ↗·Gainesville, FL

Amir EtefaghiDaryani

What's running

ViSQ

SuperFormer

CaMuViD

From pixels to primitives

Frame by frame

Published work

Selected projects

ViSQ: Vision-to-SuperQuadrics

SuperFormer: 3D Occupancy Abstraction

CaMuViD: Calibration-Free Multi-View Detection

CLASP: Spatial-Temporal Instance Association

Let's talk about vision, robotics, or research.

Amir Etefaghi
Daryani