Publications
Create newSort
Creating a Spherical Panoramic Image Generator Using SIFT
- k
- t
21 reads- Computer Vision
- Image Stitching
- +2
Intelligent Image-to-Text Extraction API Powered by Azure OCR and FastAPI
21 reads- Azure Cognitive Services
- Azure OCR
- +6
ClearPath: Enhancing Obstacle Detection for Autonomous Systems
- c
21 reads- android-app
- autonomous-systems
- +8
Sanjeevan - video calling web application for people with speaking disability
21 reads- fastapi
- mediapipe
- +2
Automated Research Paper to Podcast Conversion using LLM
9 reads- ⏱️ Time Efficiency
- 🌍 Multilingual Support (Future)
- +9
"Text-to-Image Generation Using Stable Diffusion 2"
- h
21 reads- #AI
- #ArtificialIntelligence
- +16
WikiLikeSearchLLMAgent
- s
15 reads- Agents
- GPT
- +1
DataFlowAI-Powered Data Engineering Solutions
- j
9 reads- Data Engineering
Fine Tuning Of An LLM To Aid In Symptom Diagnosis
- r
5 reads- LLM
Building a Time-Based Movie Recommendation System with BERT4Rec
31 reads- #Recommendation Systems
- BERT
- +4
ClarityAI: AI Chatbot for Cataract Diagnosis with RAG-Based Knowledge and CNN Image Analysis
- e
9 reads- Artificial Intelligence
- Chatbot
- +6
PyVisionAI: Agentic AI for Intelligent Document Processing and Visual Understanding
- r
9 reads- Agentic AI
- Apache License 2.0
- +40
Open_Recall
3 reads- DigitalMemory
- KnowledgeManagement
- +1
IST-ROS: A Flexible Object Segmentation and Tracking Framework for ROS
- k
21 reads- interactive-segmentation
- robotics
- +3
Gemini Powered Vision App ( Image Analyzer)
- a
3 reads- AI
- Chat with Image
- +7
RAGAS: A Framework for Evaluation of LLMs Systems
15 reads- Agents
- AI
- +2
SentinelFlow - AI-Powered Security Agent
- s
5 reads- agentic ai
- ai
- +5
Segmentation and classification of SAR images using Unet, Unet++, ViT and ResNet
- g
22 reads- AI
- CV
- +4
Route-Planning-for-the-Visually-Impaired-Person
22 reads- DeepLabV3
- image segmentation
- +4
Text to Image Synthesis using DC-GAN
- s
- a
21 reads- Computer Vision
- DC-GAN
- +2