Traditional lost and found systems suffer from inefficient manual processes, limited visibility, and lack of intelligent matching capabilities. This paper presents FoundIT, a novel mobile application that leverages artificial intelligence and microservices architecture to revolutionize item recovery processes. The system integrates computer vision through CLIP embeddings, vector similarity search using Qdrant, graph-based relationship modeling with Neo4j, and behavioral anomaly detection to create an intelligent, scalable platform. Our implementation demonstrates significant improvements in matching accuracy through multi-modal AI processing while maintaining system resilience through decoupled microservices. The architecture supports real-time notifications, semantic search capabilities, and automated fraud detection, addressing key limitations of existing solutions.
Keywords: Microservices, Computer Vision, Vector Database, CLIP Embeddings, Lost and Found Systems, Mobile Applications, Anomaly Detection
The proliferation of personal belongings in public spaces has created an urgent need for efficient item recovery systems. Traditional lost and found approaches, typically dependent on physical locations and manual documentation, prove inadequate for modern urban environments. Recent advances in artificial intelligence, particularly in computer vision and natural language processing, present opportunities to transform this domain through intelligent automation.
This paper introduces FoundIT, a mobile application that addresses these challenges through a sophisticated microservices architecture enhanced with AI capabilities. The system combines multi-modal embeddings, vector similarity search, and behavioral analysis to create an intelligent platform for item recovery. Our contribution lies in the novel integration of CLIP embeddings with vector databases for semantic similarity matching, coupled with a distributed microservices architecture that ensures scalability and maintainability.
Current platforms exhibit several critical shortcomings:
Analysis of existing solutions reveals significant technical limitations:
FoundIT employs a distributed microservices architecture based on the following principles:
The architecture comprises seven primary components:
Cross-platform mobile application providing user interfaces for item reporting, searching, and communication. Integrates Firebase SDK for real-time data synchronization and authentication.
Central orchestration platform managing:
FastAPI-based service functioning as the communication bridge between Firebase and specialized microservices. Handles request routing, data transformation, and response integration while publishing messages to RabbitMQ for asynchronous processing.
Facilitates decoupled communication through:
AI-powered service implementing multi-modal similarity matching through:
Behavioral analysis service employing:
Azure Blob Storage integration for scalable handling of images and documents with optimized access patterns and security controls.
The core innovation of FoundIT lies in its sophisticated similarity matching system that combines computer vision and natural language processing:
The system utilizes OpenAI's CLIP (Contrastive Language-Image Pre-training) model to create unified representations:
Embedding Process:
1. Text Processing: Item descriptions → CLIP text encoder → 512D vector
2. Image Processing: Item photos → CLIP vision encoder → 512D vector
3. Joint Representation: Combined embedding in shared vector space
4. Normalization: L2 normalization for cosine similarity computation
Qdrant serves as the high-performance vector storage and retrieval system:
Technical Specifications:
Storage Strategy:
Vector Entry Structure:
{
"id": "unique_post_identifier",
"vector": [512-dimensional CLIP embedding],
"metadata": {
"category": "electronics|clothing|accessories|documents",
"location": {"lat": float, "lng": float, "radius": int},
"timestamp": "ISO datetime",
"post_type": "lost|found",
"user_id": "unique_user_identifier"
}
}
The similarity search process implements a multi-stage approach:
Neo4j complements the vector database by storing semantic relationships:
Graph Schema:
- Nodes: POST {id, type, category, location, timestamp}
- Edges: SIMILAR_TO {similarity_score, computed_at, algorithm_version}
- Indexes: ON POST.id, POST.category, POST.location
This dual-storage approach enables:
The system implements several optimization techniques:
The suspicious user detection service implements a sophisticated anomaly detection system:
Frequency Analysis:
Duplicate Detection:
Content Analysis:
Integration of LLaMA 3 via Groq inference for nuanced content evaluation:
LLM Assessment Process:
1. Content Preprocessing: Extract text, metadata, and context
2. Prompt Engineering: Structured analysis request with examples
3. JSON Schema Validation: Ensure reliable automated decision-making
4. Risk Scoring: Combine rule-based and LLM-generated scores
5. Alert Generation: Threshold-based admin notifications
The system leverages Redis for efficient state tracking:
Frontend:
Backend Services:
AI/ML Components:
Data Storage:
Infrastructure:
Similarity Search Performance:
System Scalability:
Testing Scope: The system has been validated through controlled testing with a limited dataset of sample images and posts to demonstrate core functionality and architectural feasibility.
Functional Validation:
Architectural Verification:
Component Integration:
Security Feature Validation:
Measured Performance Metrics:
Scalability Design Validation:
The microservices architecture provides several advantages:
Qdrant's integration offers significant benefits:
Technical Improvements:
Feature Expansion:
FoundIT demonstrates the successful integration of modern AI technologies with microservices architecture to solve a real-world problem. The system's novel approach to multi-modal similarity search, combined with vector database technology and behavioral anomaly detection, creates a comprehensive solution that significantly outperforms traditional approaches.
The architecture's modularity ensures scalability and maintainability while the AI integration provides intelligent automation that enhances user experience. Performance evaluation demonstrates the system's effectiveness with high accuracy rates and strong scalability characteristics.
This work contributes to the growing field of AI-powered microservices applications and provides a foundation for future research in intelligent item recovery systems. The open-source availability of the project enables community collaboration and further innovation in this domain.
For comprehensive technical implementation details, architecture diagrams, and in-depth analysis of each system component, readers are encouraged to consult the complete technical report accompanying this publication. The detailed report provides extensive coverage of implementation specifics, configuration parameters, and deployment considerations that complement the high-level overview presented in this paper.
Additionally, the complete project presentation, source code, and documentation are available through the project repository. Readers interested in practical implementation details, code examples, or system deployment are welcome to explore these resources or contact any team member for technical discussions and collaboration opportunities.
The authors thank the National Institute of Applied Science and Technology (INSAT) for providing the academic framework and resources for this research. Special appreciation goes to Mrs. Hajer Taktak for her supervision and guidance throughout the project development.
[1] Radford, A., et al. "Learning Transferable Visual Models From Natural Language Supervision." International Conference on Machine Learning, 2021.
[2] Johnson, J., Douze, M., & Jégou, H. "Billion-scale similarity search with GPUs." IEEE Transactions on Big Data, 2019.
[3] Malkov, Y. A., & Yashunin, D. A. "Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs." IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018.
[4] Newman, M. "Networks: An Introduction." Oxford University Press, 2018.
[5] Chen, T., et al. "MicroServices: A Survey." IEEE Access, 2018.
[6] Touvron, H., et al. "LLaMA: Open and Efficient Foundation Language Models." arXiv preprint arXiv:2302.13971, 2023.
[7] Zhang, Y., et al. "Vector Database Systems: Concepts, Techniques, and Applications." ACM Computing Surveys, 2023.
Authors:
Contact Information:
For technical inquiries, collaboration opportunities, or detailed discussions about the implementation, feel free to contact any team member through their LinkedIn profiles or via the project repository.
GitHub Repository: https://github.com/AchrefHemissi/FoundIT-Computer-Vision-Powered-Lost-and-Found-Mobile-Application