*Result*: 蜂群多模态大数据自动采集管理系统设计与试验.

Title:
蜂群多模态大数据自动采集管理系统设计与试验.
Alternate Title:
Design and experiment of a multi-modal big data automatic collection and management system for bee colonies.
Authors:
刘继展1,2,3 liujizhan@163.com, 陈 耀1, 吴 硕1,2,3 2875514410@qq.com, 赵升燚1,2,3, 陆海燕4
Source:
Transactions of the Chinese Society of Agricultural Engineering. Feb2026, Vol. 42 Issue 4, p1-11. 11p.
Database:
Academic Search Index

*Further Information*

*A multimodal big data acquisition and management system is often required to reduce the reliance on manual inspection and visual annotation. This study aimed to design, implement and field-validate a cloud–edge–end coordinated system for the honeybee colonies. Non-intrusive, fully automatic, and long-term monitoring was realized on the colony behavior and environmental conditions with the reliable temporal alignment. The system consisted of an intelligent hive terminal and a cloud-based pre-annotation module. The hive terminal was integrated with a rail-mounted dual-box structure with automatic comb-shifting imaging, in order to sequentially capture the dense comb surfaces. A hive-entrance video was captured on the channel to constrain the flight trajectories, as well as the multi-point internal and external environmental sensing, including the temperature, humidity, carbon dioxide, total volatile organic compounds, particulate matter, and internal acoustic signals. A layered cloud–edge–end architecture was adopted to decouple the high-frequency video streams from the low-frequency sensor data, providing unified clock synchronization, local buffering, and stable data transmission under field network conditions. On the cloud side, a multimodal data processing was constructed to perform the timestamp alignment, automatic visual pre-annotation, and structured storage. The object detection and instance segmentation were combined with the time-series databases and relational metadata management, thereby enabling unified spatiotemporal indexing, cross-modal association, and efficient retrieval of the heterogeneous data types. A continuous 30-day field deployment was conducted at a commercial apiary in Jiangning District, Nanjing City, China. A systematic evaluation was carried out on the system stability, data accuracy, and annotation efficiency under unattended operating conditions. Stable operation was maintained after the deployment without manual intervention. The overall loss rate of the data packet remained below 6%, indicating the reliable long-term transmission of both video streams and environmental sensor data. The web interface supported five concurrent users for data browsing and downloading without observable blocking or performance degradation. Environmental measurements were recorded by the system, indicating the high consistency with the reference instruments. In external hive temperature, the mean absolute error was 0.3 °C, the root mean square error was 0.4 °C, and the Pearson correlation coefficient reached 0.995. The largest deviation of the measurement occurred in the concentration of the internal carbon dioxide, with a mean absolute error of 48 μmol/mol, a root mean square error of 62 μmol/mol, and a Pearson correlation coefficient of 0.937. Other environmental variables, including the internal temperature, humidity, and gas-related parameters, generally showed correlation coefficients above 0.98, fully meeting the accuracy requirements for the long-term apicultural monitoring and behavioral analysis. The cloud-based visual pre-annotation module was achieved in an average per-frame processing time of approximately 10 ms, covering the data loading, preprocessing, model inference, and data storage. In hive-entrance videos, the detection of the individual bees was achieved with an accuracy of 98%, thus enabling efficient extraction of the foraging activity information. In dense comb images, the instance segmentation was achieved with an average accuracy of 76% under frequent occlusion and adhesion, indicating the increasing difficulty in delineating the overlapping individuals on the crowded comb surfaces. Compared with manual labeling, the pipeline substantially improved the annotation efficiency, with sufficient accuracy for the population-level behavioral studies. Edge buffering and video compression were further compatible with the typical bandwidth of the field network, while preserving the analytical integrity of the frames and key feature segments. The system was realized for the continuous, non-intrusive acquisition, automatic pre-annotation and structured management of the multimodal honeybee colony data within a unified cloud–edge–end framework. Mechanical design, environmental sensing, and data-driven annotation were integrated to form a standardized data infrastructure in order to support the quantitative behavioral modeling and pollination efficiency assessment in intelligent apiculture. Future work can improve the instance segmentation in the densely populated comb scenes. The robustness and generalization can be enhanced to extend the long-term validation over multiple apiaries and ecological conditions. [ABSTRACT FROM AUTHOR]*

*为实现蜂群多模态数据的非侵入式、全周期自动采集与高效管理, 该研究设计了一种蜂群多模态大数据自动采 集管理系统。系统由智能蜂箱终端与自标注管理云端构成, 终端集成巢脾自动移位成像、出勤行为视频采集及多源环境 感知, 实现多模态数据的稳定获取; 云端采用多模态数据处理管道完成数据的时空对齐、自动标注与结构化存储。在南 京江宁区蜂场开展了为期 30 d 的连续田间试验, 对系统稳定性、数据准确性与标注效率进行验证。结果表明, 系统综合 数据丢包率低于 6%。环境参数采集精度较高, 其中蜂箱外温度测量误差最小, 平均绝对误差 (mean absolute error, MAE) 为 0.3 °C, 均方根误差 (root mean square error, RMSE) 为 0.4 °C, 皮尔逊相关系数 (pearson correlation coefficient, PCC) 为 0.995;蜂箱内 CO2 浓度测量误差相对较大, MAE 为 48 μmol/mol, RMSE 为 62 μmol/mol, PCC 为 0.937。自 标注模块单帧视觉数据处理时间约为 10 ms, 蜂箱口蜜蜂目标检测自标注准确率为 98%, 巢脾蜜蜂实例分割自标注平均 准确率为 76%。研究结果表明, 该系统可实现蜂群多模态数据的长期稳定获取与自动化管理, 为蜂群行为分析与智能养 殖提供可靠的数据支撑。 [ABSTRACT FROM AUTHOR]*