default search action

combined dblp search
author search
venue search
publication search

ask others

Tom Bagby

> Home > Persons

Person information

SPARQL queries

🛈 Please note that only 65% of the items listed on this page have a DOI stored with their dblp record. Therefore, DOI-based queries can only provide partial results.

run query for this person

or build your own?

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-07143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-07143
Georg Heigold, Ehsan Variani, Tom Bagby, Cyril Allauzen, Ji Ma, Shankar Kumar, Michael Riley:
Massive Sound Embedding Benchmark (MSEB). CoRR abs/2602.07143 (2026)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2605-04556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2605-04556
Cyril Allauzen, Tom Bagby, Georg Heigold, Ehsan Variani, Ke Wu:
Benchmarking LLMs on the Massive Sound Embedding Benchmark (MSEB). CoRR abs/2605.04556 (2026)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2605-27295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2605-27295
Madhuri Shanbhogue, Zhe Li, Shanfeng Zhang, Gustavo Hernández Ábrego, Shih-Cheng Huang, Aashi Jain, Daniel Salz, Sonam Goenka, Chaitra Hegde, Ji Ma, Feiyang Chen, Jiaxing Wu, Tanmaya Dabral, Babak Samari, Kevin Poulet, Daniel Cer, Kaifeng Chen, Paul Suganathan, Hui Hui, Jovan Andonov, Philippe Schlattner, Jay Han, Iftekhar Naim, Wing Lowe, Vladimir Pchelin, Albert Yang, Yi-Ting Chen, Zhongli Ding, Grace Zhang, Georg Heigold, Yichang Chen, Antoine Reveillon, Brendan Mccloskey, Wenlei Zhou, Dahun Kim, Rui Meng, Emma Wang, Jack Zheng, Halley Fede, Zhen Yang, Keegan Mosley, Brian Potetz, Sahil Dua, Henrique Schechter Vera, Shen Gao, Hesen Zhang, Andreas Hess, Hengxuan Ying, Alberto Montes, Karan Gill, Min Choi, Sebastian Russo, Anja Hauth, Jinhyuk Lee, Michael Boratko, Megan Barnes, Vikram Rao, Claudiu Musat, Cyril Allauzen, Ehsan Variani, Shankar Kumar, Tom Bagby, Junyi Jiao, Yang Gu, Tengxin Li, Ayush Agrawal, Roberto Santana, Dev Nath, Stephen Karukas, Shuoxuan Han, Lucia Loher, Alice Twu, Nidhi Vyas, Siddharth Bhai, Frank Palma Gomez, Wangyuan Zhang, Chaoren Liu, Jizheng Yang, Steve Qiu, Shijie Zhang, Sujay Kulkarni, Sascha Rothe, Sean Nakamoto, Raphael Hoffmann, Zach Gleicher, Yun-Hsuan Sung, Qin Yin, Tom Duerig, Mojtaba Seyedhosseini:
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini. CoRR abs/2605.27295 (2026)
2025
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-23292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-23292
R. J. Skerry-Ryan, Julian Salazar, Soroosh Mariooryad, David Kao, Daisy Stanton, Eric Battenberg, Matt Shannon, Ron J. Weiss, Robin Scheibler, Jonas Rothfuss, Tom Bagby:
SequenceLayers: Sequence Processing and Streaming Neural Networks Made Easy. CoRR abs/2507.23292 (2025)
2023
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuVBR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuVBR23
Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley:
Last: Scalable Lattice-Based Speech Modelling in Jax. ICASSP 2023: 1-5
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-13134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-13134
Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley:
LAST: Scalable Lattice-Based Speech Modelling in JAX. CoRR abs/2304.13134 (2023)
2022
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/StantonSMSBBK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/StantonSMSBBK22
Daisy Stanton, Matt Shannon, Soroosh Mariooryad, R. J. Skerry-Ryan, Eric Battenberg, Tom Bagby, David Kao:
Speaker Generation. ICASSP 2022: 7897-7901
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03232
Soroosh Mariooryad, Matt Shannon, Siyuan Ma, Tom Bagby, David Kao, Daisy Stanton, Eric Battenberg, R. J. Skerry-Ryan:
Learning the joint distribution of two sequences using little or no paired data. CoRR abs/2212.03232 (2022)
2021
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05095
Daisy Stanton, Matt Shannon, Soroosh Mariooryad, R. J. Skerry-Ryan, Eric Battenberg, Tom Bagby, David Kao:
Speaker Generation. CoRR abs/2111.05095 (2021)
2020
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BattenbergSMSKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BattenbergSMSKS20
Eric Battenberg, R. J. Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby:
Location-Relative Attention Mechanisms for Robust Long-Form Speech Synthesis. ICASSP 2020: 6194-6198
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HabibMSBSSKB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HabibMSBSSKB20
Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, David Kao, Tom Bagby:
Semi-Supervised Generative Modeling for Controllable Speech Synthesis. ICLR 2020
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08029
Matt Shannon, Ben Poole, Soroosh Mariooryad, Tom Bagby, Eric Battenberg, David Kao, Daisy Stanton, R. J. Skerry-Ryan:
Non-saturating GAN training as divergence minimization. CoRR abs/2010.08029 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeSPMAZRKWPLBSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeSPMAZRKWPLBSL19
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-02246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-02246
Izhak Shafran, Tom Bagby, R. J. Skerry-Ryan:
Complex Evolution Recurrent Neural Networks (ceRNNs). CoRR abs/1906.02246 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03402
Eric Battenberg, Soroosh Mariooryad, Daisy Stanton, R. J. Skerry-Ryan, Matt Shannon, David Kao, Tom Bagby:
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis. CoRR abs/1906.03402 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01709
Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, David Kao, Tom Bagby:
Semi-Supervised Generative Modeling for Controllable Speech Synthesis. CoRR abs/1910.01709 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10288
Eric Battenberg, R. J. Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby:
Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis. CoRR abs/1910.10288 (2019)
2018
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiBLMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiBLMB18
Ehsan Variani, Tom Bagby, Kamel Lahouel, Erik McDermott, Michiel Bacchiani:
Sampled Connectionist Temporal Classification. ICASSP 2018: 4959-4963
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShafranBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShafranBS18
Izhak Shafran, Tom Bagby, R. J. Skerry-Ryan:
Complex Evolution Recurrent Neural Networks (ceRNNs). ICASSP 2018: 5854-5858
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BagbyRS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BagbyRS18
Tom Bagby, Kanishka Rao, Khe Chai Sim:
Efficient Implementation of Recurrent Neural Network Transducer in Tensorflow. SLT 2018: 506-512
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06621
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018)
2017
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SimNBSB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SimNBSB17
Khe Chai Sim, Arun Narayanan, Tom Bagby, Tara N. Sainath, Michiel Bacchiani:
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow. ASRU 2017: 258-264
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiBMB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiBMB17
Ehsan Variani, Tom Bagby, Erik McDermott, Michiel Bacchiani:
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow. INTERSPEECH 2017: 1641-1645

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.