QUICK REVIEW

[论文解读] Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species

Jinyu Xu, Tianqi Hu|arXiv (Cornell University)|Mar 22, 2026

Smart Agriculture and AI被引用 0

一句话总结

TPC–268 是首个将完整分类层级整合进来的大规模植物计数基准，能够在多尺度植物影像中实现 taxonomy-aware、类无关的计数。

ABSTRACT

Visually cataloging and quantifying the natural world requires pushing the boundaries of both detailed visual classification and counting at scale. Despite significant progress, particularly in crowd and traffic analysis, the fine-grained, taxonomy-aware plant counting remains underexplored in vision. In contrast to crowds, plants exhibit nonrigid morphologies and physical appearance variations across growth stages and environments. To fill this gap, we present TPC-268, the first plant counting benchmark incorporating plant taxonomy. Our dataset couples instance-level point annotations with Linnaean labels (kingdom -> species) and organ categories, enabling hierarchical reasoning and species-aware evaluation. The dataset features 10,000 images with 678,050 point annotations, includes 268 countable plant categories over 242 plant species in Plantae and Fungi, and spans observation scales from canopy-level remote sensing imagery to tissue-level microscopy. We follow the problem setting of class-agnostic counting (CAC), provide taxonomy-consistent, scale-aware data splits, and benchmark state-of-the-art regression- and detection-based CAC approaches. By capturing the biodiversity, hierarchical structure, and multi-scale nature of botanical and mycological taxa, TPC-268 provides a biologically grounded testbed to advance fine-grained class-agnostic counting. Dataset and code are available at https://github.com/tiny-smart/TPC-268.

研究动机与目标

Motivate counting in the plant domain as a fine-grained, taxonomy-aware problem distinct from traditional crowd/vehicle counting.
Introduce TPC–268, a large-scale dataset with taxonomic labels and multi-scale imagery for robust counting across growth stages and environments.
Enable hierarchical reasoning by annotating instances with Linnaean taxonomy (kingdom to species) and organ categories.
Provide taxonomy-consistent data splits to rigorously evaluate generalization to unseen species within taxonomic gaps.

提出的方法

Define Pseudo-class-agnostic counting (CAC) for plants and build a dataset that ties counting instances to hierarchical taxonomy.
Annotate 10,000 images with 678,050 points and 30,000 bounding boxes across 242 species, organized into 268 countable categories.
Provide 7-dimensional taxonomic vectors per species and auxiliary organ metadata to enable multi-level reasoning.
Partition data with an MILP-based scheme to ensure taxonomic independence and balanced density across train/val/test sets.
Benchmark state-of-the-art CAC approaches (regression-based and detection-based) on the new dataset to study generalization across taxonomic and scale variations.
Explore incorporation of taxonomy information (text prompts) into counting models to assess inductive biases from biological structure.

实验结果

研究问题

RQ1Can existing class-agnostic counting (CAC) models generalize to hundreds of fine-grained plant species when evaluation respects taxonomic hierarchy?
RQ2How does incorporating taxonomic and organ-level information affect counting accuracy across scales and densities?
RQ3What are the relative strengths of regression-based versus detection-based CAC methods for dense, structurally entangled plant imagery?
RQ4Does cross-dataset transfer from generic object counting datasets to plant counting degrade performance, and can plant-specific taxonomy improve robustness?
RQ5To what extent do taxonomic priors enable zero-shot or few-shot generalization across related plant taxa?

主要发现

Method	Venue & Year	Backbone	Shot	Val MAE	Val RMSE	Val R^2	Test MAE	Test RMSE	Test R^2
FamNet	CVPR’21	R50	3	28.87	52.51	0.58	30.43	65.62	0.62
BMNet+	CVPR’22	R50	3	29.33	77.78	0.47	27.78	57.25	0.50
C-DETR	ECCV’22	R50	3	22.66	77.51	0.75	22.68	57.97	0.74
SPDCNet	BMVC’22	R18	3	25.66	72.49	0.52	23.70	47.53	0.64
CountTR	BMVC’22	Hybrid	3	20.21	55.82	0.73	25.19	49.94	0.62
SAFECount	WACV’23	R18	3	22.57	63.65	0.64	25.70	52.30	0.58
LOCA	ICCV’23	R50	3	17.26	53.19	0.75	17.51	38.37	0.78
DAVE	CVPR’24	R50	3	16.47	52.87	0.76	17.61	40.06	0.75
CACViT	AAAI’24	ViT-B	3	16.63	42.49	0.82	22.04	41.79	0.73
CountGD	NeurIPS’24	SWin-B	3	18.32	54.55	0.74	19.52	50.51	0.61
TasselNetV4	ISPRS’26	ViT-B	3	13.20	43.93	0.83	22.95	51.36	0.60

TPC–268 contains 10,000 images with 678,050 points and 30,000 bounding boxes across 242 species and 268 categories.
Taxonomy-aware splits (species-organization level) enable rigorous zero-shot counting evaluation across taxonomic gaps.
Regression-based CAC models generally outperform detection-based approaches on this dataset, with LOCA achieving the best test performance among regulators.
Incorporating taxonomic information as textual prompts or hierarchical taxonomy improves counting performance, evidencing a practical inductive bias from biological structure.
Cross-dataset transfer reveals that plant counting is more challenging when trained on FSC–147 and tested on TPC–268, while training on TPC–268 better generalizes to FSC–147 than the opposite direction.
Fine-grained analyses show counting difficulty is driven by taxonomic and morphological complexity (e.g., Brassicaceae and Poaceae) and scales (microscopic vs macroscopic), not just data quantity.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。