Drag

Generative AI-Powered Chat Assistant for Autistic Children (Healthcare)

This project aimed to develop a generative AI-powered chat assistant tailored to support autistic children in their social and communication development. The assistant was designed to provide personalized, empathetic interactions while offering parents and therapists actionable insights into a child’s progress. Using cutting-edge generative AI technologies and behavioral psychology frameworks, the solution empowered children to practice social skills, reduce anxiety, and enhance overall well-being.

Challenges

  1. Understanding Communication Needs:
    • Autism Spectrum Disorder (ASD) involves diverse communication styles, requiring the assistant to adapt dynamically to each child’s unique needs.
  2. Building Empathy and Trust:
    • Ensuring the AI responded in a supportive, non-judgmental, and engaging manner to foster trust and comfort.
  3. Personalization at Scale:
    • Customizing conversations to cater to individual developmental goals and communication preferences.
  4. Ensuring Safety and Compliance:
    • Adhering to healthcare regulations like HIPAA and ensuring data privacy and security.
  5. Multimodal Interaction:
    • Integrating text, voice, and visual aids to make the interaction engaging and accessible for children with varied needs.

Our Solutions

The Generative AI-powered chat assistant was built using a combination of AI technologies, healthcare expertise, and behavioral psychology principles:

  1. Generative AI Core:
    • Used OpenAI’s GPT models fine-tuned with datasets specific to ASD communication patterns.
    • Included training data on child psychology and therapeutic approaches to generate empathetic and context-aware responses.
  2. Personalization Engine:
    • Integrated machine learning algorithms to learn from individual interaction histories, adapting tone, vocabulary, and difficulty levels.
    • Enabled customizable goals for social and emotional learning, such as maintaining eye contact, initiating conversations, or managing sensory overload.
  3. Multimodal Capabilities:
    • Designed a visual interface with images, emojis, and videos to complement text and voice interactions.
    • Integrated speech-to-text and text-to-speech capabilities for children with varying communication preferences.
  4. Parental and Therapist Dashboard:
    • Provided real-time insights into a child’s progress, such as their comfort level in different scenarios or their improvement in conversational skills.
    • Offered recommendations for personalized therapy sessions and home-based activities.
  5. Data Privacy and Security:
    • Encrypted all data storage and communication.
    • Ensured compliance with HIPAA and other healthcare regulations to protect sensitive information.

Flask

Docker

Python

Scrapy

HuggingFace

Lighttag

AWS EC2

Selenium

Impacts

  1. Data Crawling and Extraction:
    • Scraped data from multiple domains using Scrapy and Selenium.
    • Parsed and structured data with Beautiful Soup for further processing.
  2. NLP Processing:
    • Identified key entities such as artist and organization names using Spacy NER.
    • Summarized data and extracted keywords for content categorization.
    • Used topic modeling to classify data into relevant categories.
  3. HTML Update Tracking:
    • Compared newly scraped data with previously stored HTML content.
    • Detected and classified updates such as text additions or removals using clustering logic.
    • Logged all updates for downstream analysis.
  4. Face Cropping API:
    • Developed a Pytorch-based MTCNN API to detect and crop artist faces from images.
    • Automated the integration of image processing with the scraping pipeline.
  5. Deployment and Storage:
    • Hosted the pipeline on AWS EC2 for high availability and scalability.
    • Stored processed data in MongoDB and DynamoDB for efficient query and retrieval.
  6. APIs and Integration:
    • Built Flask APIs to provide endpoints for data access, HTML update detection, and face cropping services.

Benefits

  1. Comprehensive Data Processing:
    • Delivered structured data with key insights like topics, keywords, and named entities.
  2. Efficient Update Detection:
    • Automated detection of content changes in HTML pages saved time and resources.
  3. Enhanced Face Detection:
    • High-quality face cropping for individuals and groups enhanced downstream tasks like profile creation.
  4. Scalable Infrastructure:
    • Dockerized and AWS-hosted pipeline ensured seamless scaling and deployment.
  5. Streamlined Access:
    • Flask APIs allowed easy integration of data scraping and processing capabilities with other systems.

Future Scope

  1. Real-Time Update Monitoring:
    • Implement real-time crawling and update detection for dynamic websites.
  2. Advanced AI Models:
    • Use transformer-based models like BERT for better summarization and topic modeling.
  3. Dashboard Integration:
    • Develop an interactive dashboard for visualizing scraped data, HTML updates, and processed results.
  4. Cross-Domain Adaptation:
    • Extend the pipeline to support data scraping and processing for additional domains and use cases.
  5. Multilingual NLP Support:
    • Incorporate multilingual models to handle data in diverse languages.

Conclusion

This AI Data Engineering Pipeline demonstrated the power of combining web scraping, NLP, and AI for structured data extraction and processing. The use of advanced clustering techniques for HTML update detection and a Pytorch-based face cropping API added unique value to the solution. With its scalable deployment on AWS and integration with MongoDB and DynamoDB, the project provided a robust and reliable platform for large-scale data engineering tasks.