Reminder that I administer the "some NLG people" Discord server for researchers and others interested in #NaturalLanguageGeneration / #TextGeneration. DM me if you want an invite link :)
Less than two weeks left until the start of #INLG2025! Who's coming to Vietnam to see the latest research in #NaturalLanguageGeneration?

The registration page for #INLG2025 is now live! Join us in Vietnam at the end of October / start of November for the best conference on #NaturalLanguageGeneration

https://2025.inlgmeeting.org/registration.html

Curious to see what will be presented? Check out this list of accepted papers! https://2025.inlgmeeting.org/accepted-papers.html

Countdown clock for the #INLG2025 direct submission deadline: https://www.timeanddate.com/countdown/generic?iso=20250718T235959&p0=3399&msg=INLG+2025+Submission+Deadline+%28direct+submissions%29&font=sanserif

Submit to the longest running conference focused on #NaturalLanguageGeneration :D

https://2025.inlgmeeting.org

(ARR commitment deadline is August 7th)

INLG 2025 Submission Deadline (direct submissions)

Countdown to 18 Jul 2025 23:59:59. Showing days, hours, minutes and seconds ticking down to 0

#SASSY

This project is going to be very much exploratory programming. I have no idea if this is going to work. It will be fun.

The program will need something to chew on. I think "example data" would be a better term than "test data". Testing is the process of confirming a program coforms to its design. At this stage we have no design.

I have chosen five paragraphs from various design documents. They include a paragraph from a preliminary design, a functional requirement, a use case, an architectural tactic and a component description.

The input to the process is an RDF model that represents the information in the paragraph. This is intended to simulate an extract from what will be in SASSY's knowledge database.

I am currently constucting these models.

#SoftwareArchitecture
#NLG #NaturalLanguageGeneration

#SASSY

There is one classification of the RST reationships that we can use. They can be either structural or semantic. The structural relationships will come from the overall design of the document. The semantic ones come from the data properties - i.e. the predicates of the RDF statements.

In practice this means that the sorted order for the RDF statements within a paragraph is based on the semantics.

This SASSY subproject aims to take lists of RDF statements and sort each list based on the semantic ranks. The RDF statements will then be agregated into sentence sized chunks.

The next step is the internal organisation of the sentences, such as adjectival phrases and subordinate clauses.

An interesting step is to identify the flow of focus and use that to replace nouns (Fred) with pronouns (he) or noun phrases (the boy). For a design document it is especially important not to introduce any ambiguity.

The final step is to create a grammar tree for each sentence and pass it to the NLG program.

#SoftwareArchitecture
#NLG #NaturalLanguageGeneration

#SASSY

The core observation of Rhetorical Structure Theory (RST) is that text can be recursively subdivided into two parts, termed the "nucleus" and the "satellite". The nucleus can be understood by itself, whereas the satellite depends on the nucleus.

RST researchers spent a lot of time trying to classify the relationships between the nucleus and satellite. Not much agreement was reached.

Their approach to text generation was to select a communicative goal which would indicate a general plan. The plan would suggest relationships and the database would then be searched for matching data.

For SASSY the communicative goal is simply "tell us everything you know". The planning then collapses to a sorting process.

While playing around with ways to sort the data it became apparent that the actual set of RST relationships was not the important aspect. What was driving the algorithm was the ranking of the relationships. This is why the researchers could never agree on what relationships should be used - it doesn't matter.

For an RDF knowledge database I think this means that I just need to assign a "semantic rank" to each property. We shall see ...

#SoftwareArchitecture
#NLG #NaturalLanguageGeneration

#SASSY

Natural Language Generation has gone through three major phases. Initially templates were used. This evolved into Rhetorical Structure Theory which tried to use planning to deduce the structure of the text. More recently Large Language Models have got the attention of the NLG community.

Templates do not have the flexibility to handle arbitrary software designs. Any novel attribute of a design would likely be overlooked by the templates.

LLMs appear to be non-deterministic. This is a problem for design documents. They need to be able to be regenerated at any time and the only changes should be a direct result of changes in the data.

A lot of work was done on RST in the 80's and 90's. The fact that there is not any readily available RST based software available today is a good clue that it was not a successful endevour.

I found two major issues with RST: Firstly the planning approach could leave important information unreported. This is a show stopper for SASSY. Secondly, in order to get good results it took months long collaboration between linguists and subject matter experts. This would also be an obsticle for SASSY that aims to be an easy to use tool for helping with software design.

Next I will discuss my plan to adapt RST for use in SASSY.

#SoftwareArchitecture
#NLG #NaturalLanguageGeneration

It's been a while since I wrote an #introduction for folks who are #NewHere.

I'm a #queer #Catholic computational #linguist who works on #NaturalLanguageGeneration and lives in #Scotland. An #immigrant multiple times over, I was born in #SouthAfrica, grew up in the USA (in #Florida), did my PhD in #Germany, and moved here in 2019. I enjoy #VideoGames, #VideoEssays, #OpenMic nights, and #running. I happen to be #nonbinary and love #languages.

Facilitating Team Communication by AI Prompting Humans! Introducing Daily Team Sync

https://rajiv.com/blog/2024/07/21/facilitating-team-communication-by-ai-prompting-humans-introducing-daily-team-sync/

Facilitating Team Communication by AI Prompting Humans! Introducing Daily Team Sync - rajiv.com

In the ever-evolving landscape of remote work, maintaining team cohesion and productivity can be a challenge. As someone who’s led distributed teams across various media and technology companies, I’ve often pondered how we might leverage AI to enhance our daily interactions without losing the human touch. This reflection led me to create Daily Team Sync, […]

rajiv.com