Implementasi AI Object Recognition Real-Time Menggu- nakan TensorFlow.js dan Integrasi WhatsApp
Keywords:
Object Detection, Object Recognition, Overlap Logic, tensorflow.js, WhatsApp BotAbstract
Real-time monitoring system development frequently encounters accessibility challenges when deployed on mid-to-low-end hardware. This study documents the experimental process of developing a web-based AI object recognition system utilizing TensorFlow.js and the COCO-SSD model. Prior research employing COCO-SSD has demonstrated suboptimal performance, with response times exceeding 33 ms. During the development phase, custom logic incorporating overlap mechanisms and cooldown features was implemented to address limitations inherent in basic object detection when recognizing human-object interactions. To optimize real-time performance, this logic was applied at the application level rather than within the AI model itself, leveraging the latest deep learning methodologies proven to outperform YOLO. Using a private training dataset comprising limited facial and indoor object images, the system successfully visualizes bounding boxes and sends instant WhatsApp alerts via whatsapp-web.js. The methodology adheres to an integrated web-based object detection workflow. Experimental results demonstrate a responsive system with latency below 30 ms, meeting real-time performance standards. This paper concludes that the JavaScript-based AI stack, combined with spatial logic, effectively provides a functional solution for automatic activity recognition.
Downloads
References
[1] N. Carion, F. Massa, G. Synnaeve, N. Usunier, and A. Kirillov, “End-to-End Object Detection with Transformers,” pp. 1–17, 2020.
[2] C. Wang, A. Bochkovskiy, and H. M. Liao, “YOLOv7 : Trainable bag-of-freebies sets new state- of-the-art for real-time object detectors,” pp. 7464–7475, 2023.
[3] D. Smilkov et al., “TensorFlow.js: Machine Learning for the Web and Beyond,” Proc. Mach. Learn. Syst., 2019.
[4] M. Viola, P.; Jones, “Rapid Object Detection Using a Boosted Cascade of Simple Features,” Mit- subishi Electr., 2001.
[5] S. Ren, K. He, and R. Girshick, “Faster R-CNN : Towards Real-Time Object Detection with Re- gion Proposal Networks,” pp. 1–9, 2017.
[6] W. L. B et al., “SSD : Single Shot MultiBox Detector,” vol. 1, pp. 21–37, 2016, doi: 10.1007/978-3-319-46448-0.
[7] K. Manji, J. Hanefeld, T. De Gruchy, J. Vearey, and H. Walls, “Using WhatsApp messenger for health systems research : a scoping review of available literature,” no. April, pp. 774–789, 2021, doi: 10.1093/heapol/czab024.
[8] M. Tan, R. Pang, and Q. V Le, “EfficientDet : Scalable and Efficient Object Detection,” pp. 10781–10790, 2020.
[9] C. Wang, A. Bochkovskiy, and H. M. Liao, “Scaled-YOLOv4 : Scaling Cross Stage Partial Net- work,” pp. 13029–13038, 2021.
[10] M. Kotthapalli and D. Ravipati, “YOLOv1 to YOLOv11 : A Comprehensive Survey of Real- Time Object Detection Innovations and Challenges,” pp. 1–13, 2025.
[11] D. Smilkov et al., “TensorFlow.js: Machine Learning for the Web and Beyond,” 2019, [Online]. Available: http://arxiv.org/abs/1901.05350
[12] J. Redmon and A. Farhadi, “YOLOv3: An Incremental Improvement,” 2018, [Online]. Available: http://arxiv.org/abs/1804.02767
[13] A. G. Howard et al., “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications,” 2017, [Online]. Available: http://arxiv.org/abs/1704.04861
[14] M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L. C. Chen, “MobileNetV2: Inverted Resid- uals and Linear Bottlenecks,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 4510–4520, 2018, doi: 10.1109/CVPR.2018.00474.
[15] S. S. S. R. Depuru, L. Wang, V. Devabhaktuni, and N. Gudi, “Smart meters for power grid - Challenges, issues, advantages and status,” 2011 IEEE/PES Power Syst. Conf. Expo. PSCE 2011, pp. 1–7, 2011, doi: 10.1109/PSCE.2011.5772451.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Agung Setyadi, Adhy Rizaldy, Atika Reski, Muhammad Al Fauzan Bobihu

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.
