Output modalities (e.g. ['audio'] or ['text']). Defaults to ['audio']. The server only accepts a single modality at a time.
['audio']
['text']
final List<String>? outputModalities;