Skip to content
View CreativeSelf0's full-sized avatar
🚀
🚀

Block or report CreativeSelf0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CreativeSelf0/README.md

About me

Hello, I'm Abulmajeed 👋

I'm a Master’s student in the Master of Language Technologies (MLT) program at Carnegie Mellon University. I come from an applied machine learning background, and my current interests focus on multimodal learning, spoken language modeling, and the intersection of language, audio, and vision.

I am particularly interested in how modern foundation models enable more natural human–computer interaction, as well as how multimodal systems can reason over heterogeneous signals. Outside of coursework and research, I enjoy experimenting with new tools, frameworks, and model architectures.


General

  • Blogger 📖 | I occasionally write about machine learning, systems, and topics I find interesting.
  • Experimentalist 🚀 | I build and share small experimental projects when time permits.
  • Mint Tea Connoisseur 🍵 | Still convinced good mint tea is an underrated skill.

Pinned Loading

  1. TranscriptToPDF TranscriptToPDF Public

    This project aims to convert YouTube transcripts into readable PDF documents with time stamps. The purpose is to provide a convenient way for knowledge seekers to follow along and absorb valuable i…

    4 2

  2. PH-KDX/flightplandb-py PH-KDX/flightplandb-py Public

    Python wrapper for the Flight Plan Database API

    Python 17 4

  3. abjadai/catt abjadai/catt Public

    The official implementation of CATT Arabic diacritization models.

    Python 58 9

  4. NVIDIA-NeMo/NeMo NVIDIA-NeMo/NeMo Public

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    Python 16.6k 3.3k