AgriChrono — Jaehwan (Jay) Jeong

TL;DR

we propose AgriChrono, a comprehensive framework purpose-built for "in-the-wild" scene-level 3D reconstruction, featuring three core contributions: a custom robotic platform, an 18 TB multi-modal dataset, and a stress-test benchmark. First, we deploy a modular robotic platform equipped with a highly accurate, time-synchronized sensor suite (LiDAR-RGB-Depth-IMU) to continuously log the high-fidelity 6-DoF poses and structural data required for reliable 3D reconstruction. Second, utilizing this platform, we release a massive dataset that rigorously captures continuous morphological crop growth and severe daily illumination shifts over a month. Finally, we establish a 7-scenario 3D reconstruction benchmark to evaluate 16 state-of-the-art NeRF and 3DGS methods. While validating our platform's data-gathering efficacy, this benchmark explicitly exposes critical robustness issues in existing baselines, demonstrating their failure to maintain consistent performance under wind-induced crop motion and lighting variations. Ultimately, AgriChrono provides a crucial stress-test to catalyze the development of robust, next-generation in-the-wild 3D reconstruction models.

Key Contributions

Modular Robotic Acquisition Platform — we develop a custom, remote-operable field robot equipped with a precise, time-synchronized multi-modal sensor suite (RGB, Depth, LiDAR, IMU, and Pose). This platform serves as a foundational tool, validating that high-fidelity 3D reconstruction is achievable in challenging, "in-the-wild" agricultural environments.
Large-Scale In-the-Wild Dataset — we release the comprehensive 18 TB AgriChrono dataset, uniquely capturing continuous morphological crop growth and systematic daily illumination changes. Collected daily over a one-month period, this dataset documents complex, scene-level structural changes to significantly accelerate real-world precision agriculture research.
3D Reconstruction Stress-Test Benchmark — we introduce a rigorous benchmark featuring 7 distinct scenarios to evaluate 16 state-of-the-art NeRF and 3DGS methods. This benchmark illustrates the significant challenges of non-rigid, dynamic scenes, explicitly exposing the fundamental robustness limitations of existing baselines against wind-induced motion and lighting shifts.

Project Design

1. Field & Remote Operation

This project was conducted across three distinct crop sites (Canola, Canola Genotypes, and Flax) at the North Dakota State University (NDSU) experimental farm. To enable agricultural researchers to operate the robot effortlessly without terminal commands, we developed a custom WebRTC-based low-latency Web UI using Flask and Janus. Initially, we intended to use a single ZED X camera for both data acquisition and monitoring. However, this approach presented several constraints: the ZED SDK's lack of simultaneous processing support, high latency in ROS implementation, storage capacity issues due to ROSBAG's low compression rate, and a fixed camera field-of-view (FoV). To resolve this, ZED data saving was restricted exclusively to its highly compressed .svo2 format, and an independent Obsbot PTZ camera was installed specifically for monitoring, thereby reducing system load and enhancing teleoperation convenience. (For an overview, see GitHub.)

2. Robot Platform Design

Built on a Scout UGV base, the platform integrates an NVIDIA Jetson AGX Orin, dual ZED X cameras, and Livox LiDAR. To meet varying sensor voltage needs from the robot's 24–27V output, we used DC-DC converters to supply 12V (LiDAR) and 18V (Jetson). This enabled over 4 hours of continuous operation per charge, supporting our rigorous daily schedule. Due to severe GPS interference from the nearby Hector International Airport, we omitted our planned RTK-GNSS module, pivoting to purely vision and IMU-based localization (AprilTags planned for future work).
(See Hardware GitHub.)

To manage the robot remotely nationwide, we used Tailscale VPN for static IP assignment, ensuring consistent connectivity. We also implemented a fail-safe mechanism that instantly halts the robot upon communication loss, prioritizing safe unattended operation. These optimizations enabled stable, long-range teleoperation from UCLA — 1,500 miles away — yielding a massive 18 TB dataset.
(See Software GitHub.)

Live video streaming from the field robot at NDSU to the remote operator at UCLA — 1,500 miles away.

3. Data Collection Protocol

AgriChrono data modalities and collection schedule

To document continuous morphological crop growth and natural illumination variations, data were systematically collected four times daily over a three-week period. To counteract extreme field temperatures exceeding 100°F, we installed a sun-shielded top panel for thermal protection. However, this passive cooling was not completely sufficient during prolonged operations, occasionally resulting in FPS degradation due to thermal throttling (as future work, we plan to upgrade the thermal management system, including improved active ventilation). Collection was suspended during severe weather to prevent hardware failure. When post-rain muddy soil rendered the paths impassable, we manually placed wooden planks along the traversal routes to secure necessary traction, allowing us to maintain a consistent data collection frequency. (See Dataset GitHub.)

RGB, Depth, and LiDAR data streaming from the field robot during data collection.

4. 3D Reconstruction Benchmark

To provide a highly reliable benchmark, we established initial geometries via point clouds and utilized Visual-Inertial Odometry (VIO) fusing RGB and IMU data, validated to yield the highest pose accuracy through rigorous comparative experiments. Combining these poses with the RGB images, we constructed and publicly released a 3D reconstruction benchmark in COLMAP format, featuring seven distinct scenarios. Our analysis revealed that under favorable conditions — stable lighting and dense canopy structures — state-of-the-art models achieved high accuracy, effectively validating our sensor pipeline. Conversely, under harsher scenarios (e.g., intense illumination, sparse crops, wind-induced motion), baseline performance fluctuated significantly, demonstrating that existing AI models remain highly vulnerable to unconstrained, "in-the-wild" environments.
(Dataset download: Benchmark GitHub · Leaderboard: Benchmark Page)

Novel View Synthesis Benchmark

Method	Lighting Variance (Day 19)								Growth Span (6 AM)
	06:00 AM		11:00 AM		04:00 PM		09:00 PM		Day 6		Day 13		Day 20
	PSNR ↑	SSIM ↑	PSNR ↑	SSIM ↑	PSNR ↑	SSIM ↑	PSNR ↑	SSIM ↑	PSNR ↑	SSIM ↑	PSNR ↑	SSIM ↑	PSNR ↑	SSIM ↑
3DGUT	26.433	0.6915	22.860	0.5488	24.147	0.5906	27.813	0.7134	23.706	0.5130	24.856	0.5597	25.858	0.6477
Octree-GS	28.451	0.8009	24.402	0.7044	25.593	0.7269	29.716	0.8092	26.211	0.7098	27.669	0.7545	27.677	0.7771
gsplat	27.242	0.7637	23.740	0.6630	25.012	0.6882	28.714	0.7827	25.431	0.6348	26.730	0.6977	26.922	0.7355
3D-MCMC	27.053	0.7194	23.385	0.5886	24.667	0.6281	28.180	0.7370	24.556	0.5566	26.134	0.6349	26.357	0.6795
WildGaussians	7.298	0.4037	11.498	0.3847	11.022	0.4061	8.158	0.4506	23.057	0.5129	24.422	0.5812	11.655	0.5140
GS-W	20.338	0.6593	17.946	0.5033	20.337	0.5588	22.819	0.7015	23.474	0.5186	23.616	0.5885	19.967	0.5889
H3DGS	27.990	0.7939	24.412	0.7076	25.488	0.7204	29.330	0.8042	25.473	0.6680	27.260	0.7390	27.350	0.7685
3DGRT	26.702	0.7653	23.599	0.6616	24.733	0.6896	28.463	0.7828	25.317	0.6351	26.424	0.6986	26.301	0.7373
Taming3DGS	27.908	0.7756	24.342	0.6854	25.453	0.7012	29.179	0.7896	25.780	0.6533	27.241	0.7187	27.534	0.7495
Scaffold-GS	25.542	0.7403	21.193	0.6280	24.661	0.6739	27.770	0.7743	24.816	0.6427	25.286	0.6887	24.612	0.7032
PGSR	26.697	0.7429	23.673	0.6415	24.634	0.6610	28.277	0.7585	24.148	0.5459	25.236	0.6271	26.607	0.7122
3DGS	27.107	0.7717	23.950	0.6742	24.970	0.6932	28.788	0.7884	25.449	0.6449	26.891	0.7097	26.987	0.7420
Zip-NeRF	29.582	0.7953	25.923	0.7562	27.466	0.7663	30.675	0.8065	25.943	0.6097	28.844	0.7742	29.418	0.8105
Tetra-NeRF	28.225	0.7333	24.057	0.6068	25.564	0.6458	29.337	0.7558	25.324	0.5976	27.096	0.6581	27.388	0.7020
NerfStudio	23.703	0.6389	20.275	0.4353	21.573	0.5007	24.739	0.6555	22.133	0.4449	18.809	0.4454	23.194	0.5809
SeaThru-NeRF	25.371	0.6274	21.421	0.4415	22.885	0.4975	26.456	0.6527	22.691	0.4429	21.488	0.4601	24.084	0.5681

Novel view synthesis benchmark on Site 1. Bold = best, underline = second best per column.

AgriChrono qualitative reconstruction results

Qualitative comparison of novel view synthesis results across evaluated methods.

Conclusion

We present AgriChrono, a comprehensive field robotics framework whose benchmark results serve a dual purpose. First, they validate our robotic platform's design, proving it successfully captures the high-fidelity data required for reliable 3D reconstruction in complex agricultural fields. Second, by explicitly exposing the performance degradation of current baselines under dynamic "in-the-wild" stressors, our benchmark provides a vital stress-test for the AI community. This research acts as a critical stepping stone, catalyzing the development of more robust models capable of unconstrained scene understanding — eventually bridging the gap to extract concrete downstream phenotyping metrics.

Acknowledgement

NDSU

Experimental farm facilities, on-site data collection support, and agricultural domain expertise provided by North Dakota State University (NDSU).

This work was supported in part by the U.S. Department of Agriculture (Grant No. 2024-67021-42528 and 2022-67022-37021), the Korea Creative Content Agency (KOCCA) under Grant RS-2024-00345025, and the Institute of Information & Communications Technology Planning & Evaluation (IITP) funded by the Korean government (MSIT) under Grant No. RS-2019-II190079.