Month’s Focus: Toolkit Creation & Enhanced Language Compatibility

This month’s initiative centers on developing versatile linguistic tools and expanding our core Speech-to-Text (STT) capabilities. Our primary focus is on the creation of the Mahina Toolkit (mahina-tk), a modular language utility set designed to work seamlessly in both live speech and static text environments.

Mahina Toolkit (mahina-tk)

Modular Design:

Merges existing Core Tools into a portable, modular web app
Supports both live speech (real-time STT) and static text analysis.

Integrated Platform Development:

Building Mahina HUD visionOS app as a central platform for Natural Language Processing (NLP)
toolsets and AR language overlay tech.
Experimental integration of mixed reality features to enhance user
interaction.

Additional Developments:

Deployment of advanced handwriting and text recognition tools, including OCR functionalities.
Handwriting tools: implementation of deepseek-vl-1.3b model for character recognition (OCR).

Availability:

mahina-tk API will be available for commercial licensing, with free access offered to educational institutions.

Toolkit Use Cases

Use Case 1: Real-Time Speech-to-Text (STT) Processing

- Feature:
  - Transcribes live speech with transient word display, POS
    tagging, integrated language tools

Use Case 2: Static Text Analysis

- Feature:
  - - Analyzing longer, static text with POS tagging, integrated
      language tools.

STT Multisub Support & Compatibility Updates

Transliteration – Romanization:

- Expanded language support now includes Thai, Lao, Croatian, Greek, Lithuanian, Swedish, Arabic and more.

Transliteration Mode:
- Chinese (zh-TW, zh-CN), added partial pinyin support.

Ongoing & Future Developments

PencilKit Integration:

- Currently under construction, integrating with OCR tools with open-source language models such as
  deepseek-vl-1.3b-chat

Visual Environment Augmentation (VEA):
- In the early exploration phase with a focus on hardware compatibility.
- 3D Mapping and other environment augmentation tools being explored.

Stay updated here:

Github: github.com/phasatek

Bluesky: bsky.app/profile/phasatek.bsky.social

X: x.com/phasatek