[논문 리뷰] Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
TPC–268은 전체 분류 계층을 통합한 최초의 대규모 식물 개체 수 벤치마크로, 다중 스케일 식물 이미징에서 분류학 인지, 계급 무관한 개체 수 산출을 가능하게 한다.
Visually cataloging and quantifying the natural world requires pushing the boundaries of both detailed visual classification and counting at scale. Despite significant progress, particularly in crowd and traffic analysis, the fine-grained, taxonomy-aware plant counting remains underexplored in vision. In contrast to crowds, plants exhibit nonrigid morphologies and physical appearance variations across growth stages and environments. To fill this gap, we present TPC-268, the first plant counting benchmark incorporating plant taxonomy. Our dataset couples instance-level point annotations with Linnaean labels (kingdom -> species) and organ categories, enabling hierarchical reasoning and species-aware evaluation. The dataset features 10,000 images with 678,050 point annotations, includes 268 countable plant categories over 242 plant species in Plantae and Fungi, and spans observation scales from canopy-level remote sensing imagery to tissue-level microscopy. We follow the problem setting of class-agnostic counting (CAC), provide taxonomy-consistent, scale-aware data splits, and benchmark state-of-the-art regression- and detection-based CAC approaches. By capturing the biodiversity, hierarchical structure, and multi-scale nature of botanical and mycological taxa, TPC-268 provides a biologically grounded testbed to advance fine-grained class-agnostic counting. Dataset and code are available at https://github.com/tiny-smart/TPC-268.
연구 동기 및 목표
- Motivate counting in the plant domain as a fine-grained, taxonomy-aware problem distinct from traditional crowd/vehicle counting.
- Introduce TPC–268, a large-scale dataset with taxonomic labels and multi-scale imagery for robust counting across growth stages and environments.
- Enable hierarchical reasoning by annotating instances with Linnaean taxonomy (kingdom to species) and organ categories.
- Provide taxonomy-consistent data splits to rigorously evaluate generalization to unseen species within taxonomic gaps.
제안 방법
- Define Pseudo-class-agnostic counting (CAC) for plants and build a dataset that ties counting instances to hierarchical taxonomy.
- Annotate 10,000 images with 678,050 points and 30,000 bounding boxes across 242 species, organized into 268 countable categories.
- Provide 7-dimensional taxonomic vectors per species and auxiliary organ metadata to enable multi-level reasoning.
- Partition data with an MILP-based scheme to ensure taxonomic independence and balanced density across train/val/test sets.
- Benchmark state-of-the-art CAC approaches (regression-based and detection-based) on the new dataset to study generalization across taxonomic and scale variations.
- Explore incorporation of taxonomy information (text prompts) into counting models to assess inductive biases from biological structure.
실험 결과
연구 질문
- RQ1Can existing class-agnostic counting (CAC) models generalize to hundreds of fine-grained plant species when evaluation respects taxonomic hierarchy?
- RQ2How does incorporating taxonomic and organ-level information affect counting accuracy across scales and densities?
- RQ3What are the relative strengths of regression-based versus detection-based CAC methods for dense, structurally entangled plant imagery?
- RQ4Does cross-dataset transfer from generic object counting datasets to plant counting degrade performance, and can plant-specific taxonomy improve robustness?
- RQ5To what extent do taxonomic priors enable zero-shot or few-shot generalization across related plant taxa?
주요 결과
| 방법 | 저널/학회 및 연도 | 백본 네트워크 | 샷 | 검증 MAE | 검증 RMSE | 검증 R^2 | 테스트 MAE | 테스트 RMSE | 테스트 R^2 |
|---|---|---|---|---|---|---|---|---|---|
| FamNet | CVPR’21 | R50 | 3 | 28.87 | 52.51 | 0.58 | 30.43 | 65.62 | 0.62 |
| BMNet+ | CVPR’22 | R50 | 3 | 29.33 | 77.78 | 0.47 | 27.78 | 57.25 | 0.50 |
| C-DETR | ECCV’22 | R50 | 3 | 22.66 | 77.51 | 0.75 | 22.68 | 57.97 | 0.74 |
| SPDCNet | BMVC’22 | R18 | 3 | 25.66 | 72.49 | 0.52 | 23.70 | 47.53 | 0.64 |
| CountTR | BMVC’22 | Hybrid | 3 | 20.21 | 55.82 | 0.73 | 25.19 | 49.94 | 0.62 |
| SAFECount | WACV’23 | R18 | 3 | 22.57 | 63.65 | 0.64 | 25.70 | 52.30 | 0.58 |
| LOCA | ICCV’23 | R50 | 3 | 17.26 | 53.19 | 0.75 | 17.51 | 38.37 | 0.78 |
| DAVE | CVPR’24 | R50 | 3 | 16.47 | 52.87 | 0.76 | 17.61 | 40.06 | 0.75 |
| CACViT | AAAI’24 | ViT-B | 3 | 16.63 | 42.49 | 0.82 | 22.04 | 41.79 | 0.73 |
| CountGD | NeurIPS’24 | SWin-B | 3 | 18.32 | 54.55 | 0.74 | 19.52 | 50.51 | 0.61 |
| TasselNetV4 | ISPRS’26 | ViT-B | 3 | 13.20 | 43.93 | 0.83 | 22.95 | 51.36 | 0.60 |
- TPC–268 contains 10,000 images with 678,050 points and 30,000 bounding boxes across 242 species and 268 categories.
- Taxonomy-aware splits (species-organization level) enable rigorous zero-shot counting evaluation across taxonomic gaps.
- Regression-based CAC models generally outperform detection-based approaches on this dataset, with LOCA achieving the best test performance among regulators.
- Incorporating taxonomic information as textual prompts or hierarchical taxonomy improves counting performance, evidencing a practical inductive bias from biological structure.
- Cross-dataset transfer reveals that plant counting is more challenging when trained on FSC–147 and tested on TPC–268, while training on TPC–268 better generalizes to FSC–147 than the opposite direction.
- Fine-grained analyses show counting difficulty is driven by taxonomic and morphological complexity (e.g., Brassicaceae and Poaceae) and scales (microscopic vs macroscopic), not just data quantity.
더 나은 연구,지금 바로 시작하세요
연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.
카드 등록 없음 · 무료 플랜 제공
이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.