Dr Suzy J Styles

837 Followers
184 Following
250 Posts
Developmental psycholinguist: babies, brains, multilingualism, multisensory perception, occasionally apes. BLIP Lab in Psychology at NTUsg in Singapore.
ORCiD: https://orcid.org/0000-0003-3517-9680
Okay folks we are deep in the semester with an Ape’s Guide to Human Language (my most bonkers interdisciplinary elective), and guess who bumped into one of the local primates on campus 😍

It’s official - I’ve made tenure! I’m now an Associate Professor in Psychology at Nanyang Technological University Singapore!🎉

This wouldn’t have been possible without my awesome mentors, my academic support sisters 💪💪💪, the wonderful wonderful junior collaborators in my lab - past and present - and of course, my anonymous letter writers 🙏

Baseline system + leaderboards are up for #MerlionChallenge untangling complex code-mixed speech. Which #ML #DeepLearning #SpeechProc system will do the best job on complex language use in the wild? 👀

TWO TEAMS have already beaten the baseline for Language ID:
🎉Lingua_Lumos (Closed)
🎉UNSW_Signal_Processing (Open)

There’s still time to join the challenge and prep your paper for our special session at #Interspeech2023

https://toot.community/@suzyjstyles/109793982337809507

Dr Suzy J Styles (@[email protected])

Have you ever seen auto-generated subtitles turn to mush because they couldn’t handle a speaker’s accent or figure out what language they’re speaking after a switch? The #MerlionChallenge for #Interspeech23 tests how well teams can build a language detection system for Code-Switching in >300 Zoom recordings. Help build robust systems for multilingualism by joining the challenge or sharing with #ML #DeepLearning #SpeechProc friends 💪🏼💪🏽💪🏿 https://toot.community/@suzyjstyles/109713790862725145

toot.community

We provide:
👉Training data (closed track)
👉Dev audio + ground truth
👉Eval audio w/ reduced annotations
👉CodaLab submission and scoring
👉FOUR live Leaderboards
👉Leaderboard Chat 😏

You can add:
👉Up to 100hr training data (open track)

For the #MerlionChallenge at #Interspeech we’ll be asking teams to train a #SpeechProc / #AI system that can guess which language is which (Task 1: Language ID) and when (Task 2: Language Diarization)!

👉Challenge audio is Zoom recordings with English and/or Mandarin Chinese
👉Audio for development matches audio for evaluation 😗👌

Our annotation protocol is documented in the BELA transcription conventions. The Wiki includes instructions for how to do multi-tier multilingual transcriptions using Elan (free!)

BELA Con:
blipntu.github.io/belacon/

For the #MerlionChallenge we hold some info back

I’m sure I have a bunch of #Multilingual #LangDev, #SpeechProc #NLP and #CogSci friends over here 🦣

We’ve prepped >30hrs of our English/Mandarin code-switched child directed speech for the #MerlionChallenge at this year’s INTERSPEECH
>300 files, >100 voices 🙀 (+ training data)

We’re looking for speech systems that can figure out which language is spoken when!

The #MerlionChallenge will see whose system does the best job 💪🏼

Join or help us boost the message: https://sites.google.com/view/merlion-ccs-challenge/

MERLIon CCS Interspeech 2023

About

It’s all happening folks!
The #EinsteinFoundationAwards are being presented right now! This unique award recognises #OpenScience #Transparency #Reproducibility and radical initiatives in #QualityAssurance

Swapped 29°C Singapore for 9°C Berlin (That’s 84F and 48F for you Americanists 😅) Same humidity though - 89%!!!

Blue skies and light fog. CRISP 🥶