Jürgen Schmidhuber

From Infogalactic: the planetary knowledge core
Jump to: navigation, search
Jürgen Schmidhuber
File:Jürgen Schmidhuber.jpg
Schmidhuber speaking at the AI for GOOD Global Summit in 2017
Born 17 January 1963[1]
Munich,[1] West Germany
Nationality German
Fields Artificial intelligence
Institutions Dalle Molle Institute for Artificial Intelligence Research
Alma mater Technical University of Munich
Known for Artificial intelligence, deep learning, artificial neural networks, recurrent neural networks, Gödel machine, artificial curiosity, meta-learning
Website
people.idsia.ch/~juergen

Jürgen Schmidhuber (born 17 January 1963)[1] is a computer scientist most noted for his work in the field of artificial intelligence, deep learning and artificial neural networks. He is a co-director of the Dalle Molle Institute for Artificial Intelligence Research in Lugano, in Ticino in southern Switzerland.[2]:{{{3}}} Following Google Scholar, from 2016 to 2021 he has received more than 100,000 scientific citations.[3] He has been referred to as "father of modern AI,"[4][5][6][7][8][9][10] "father of AI,"[11][12]:{{{3}}}[13] "dad of mature AI,"[2]:{{{3}}} "Papa" of famous AI products,[14]:{{{3}}} "Godfather,"[15][7] and "father of deep learning."[16][7] (Schmidhuber himself, however, has called Alexey Grigorevich Ivakhnenko the "father of deep learning."[17])

Schmidhuber did his undergraduate studies at the Technical University of Munich in Munich, Germany.[1] He taught there from 2004 until 2009 when he became a professor of artificial intelligence at the Università della Svizzera Italiana in Lugano, Switzerland.[18]:{{{3}}}

Work

With his students Sepp Hochreiter, Felix Gers, Fred Cummins, Alex Graves, and others, Schmidhuber published increasingly sophisticated versions of a type of recurrent neural network called the long short-term memory (LSTM). First results were already reported in Hochreiter's diploma thesis (1991) which analyzed and overcame the famous vanishing gradient problem.[19] The name LSTM was introduced in a tech report (1995) leading to the most cited LSTM publication (1997).[20]

The standard LSTM architecture which is used in almost all current applications was introduced in 2000.[21] Today's "vanilla LSTM" using backpropagation through time was published in 2005,[22][23] and its connectionist temporal classification (CTC) training algorithm[24] in 2006. CTC enabled end-to-end speech recognition with LSTM. In 2015, LSTM trained by CTC was used in a new implementation of speech recognition in Google's software for smartphones.[2]:{{{3}}} Google also used LSTM for the smart assistant Allo[25] and for Google Translate.[26][27] Apple used LSTM for the "Quicktype" function on the iPhone[28][29] and for Siri.[30] Amazon used LSTM for Amazon Alexa.[31] In 2017, Facebook performed some 4.5 billion automatic translations every day using LSTM networks.[32] Bloomberg Business Week wrote: "These powers make LSTM arguably the most commercial AI achievement, used for everything from predicting diseases to composing music."[15]

In 2011, Schmidhuber's team at IDSIA with his postdoc Dan Ciresan also achieved dramatic speedups of convolutional neural networks (CNNs) on fast parallel computers called GPUs. An earlier CNN on GPU by Chellapilla et al. (2006) was 4 times faster than an equivalent implementation on CPU.[33] The deep CNN of Dan Ciresan et al. (2011) at IDSIA was already 60 times faster[34] and achieved the first superhuman performance in a computer vision contest in August 2011.[35] Between 15 May 2011 and 10 September 2012, their fast and deep CNNs won no fewer than four image competitions.[36][37] They also significantly improved on the best performance in the literature for multiple image databases.[38] The approach has become central to the field of computer vision.[37] It is based on CNN designs introduced much earlier by Yann LeCun et al. (1989)[39] who applied the backpropagation algorithm to a variant of Kunihiko Fukushima's original CNN architecture called neocognitron,[40] later modified by J. Weng's method called max-pooling.[41][37]

In 2014, Schmidhuber formed a company, Nnaisense, to work on commercial applications of artificial intelligence in fields such as finance, heavy industry and self-driving cars. Sepp Hochreiter, Jaan Tallinn, and Marcus Hutter are advisers to the company.[2]:{{{3}}} Sales were under US$11 million in 2016; however, Schmidhuber states that the current emphasis is on research and not revenue. Nnaisense raised its first round of capital funding in January 2017. Schmidhuber's overall goal is to create an all-purpose AI by training a single AI in sequence on a variety of narrow tasks.[42]

Views

According to The Guardian,[43] Schmidhuber complained in a "scathing 2015 article" that fellow deep learning researchers Geoffrey Hinton, Yann LeCun and Yoshua Bengio "heavily cite each other," but "fail to credit the pioneers of the field", allegedly understating the contributions of Schmidhuber and other early machine learning pioneers including Alexey Grigorevich Ivakhnenko who published the first deep learning networks already in 1965. LeCun denied the charge, stating instead that Schmidhuber "keeps claiming credit he doesn't deserve".[2]:{{{3}}}[43] Schmidhuber replied that LeCun did not provide a single example for his statement, and listed several priority disputes.[44]

Recognition

Schmidhuber received the Helmholtz Award of the International Neural Network Society in 2013,[45]:{{{3}}} and the Neural Networks Pioneer Award of the IEEE Computational Intelligence Society in 2016[46]:{{{3}}} for "pioneering contributions to deep learning and neural networks."[1] He is a member of the European Academy of Sciences and Arts.[47]:{{{3}}}[18]:{{{3}}}

References

  1. 1.0 1.1 1.2 1.3 1.4 [1]
  2. 2.0 2.1 2.2 2.3 2.4 John Markoff (27 November 2016). When A.I. Matures, It May Call Jürgen Schmidhuber ‘Dad’. The New York Times. Accessed April 2017.
  3. Lua error in package.lua at line 80: module 'strict' not found.
  4. Lua error in package.lua at line 80: module 'strict' not found.
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. Lua error in package.lua at line 80: module 'strict' not found.
  7. 7.0 7.1 7.2 Lua error in package.lua at line 80: module 'strict' not found.
  8. Lua error in package.lua at line 80: module 'strict' not found.
  9. Lua error in package.lua at line 80: module 'strict' not found.
  10. Lua error in package.lua at line 80: module 'strict' not found.
  11. Lua error in package.lua at line 80: module 'strict' not found.
  12. Telekom (21 April 2017). Video-Interview mit Prof. Jürgen Schmidhuber, oft als Vater der Künstlichen Intelligenz bezeichnet (often called the father of AI). Telekom. Accessed August 2021.
  13. Ruth Fulterer (21 February 2021). Der unbequeme Vater der künstlichen Intelligenz lebt in der Schweiz (The inconvenient father of AI lives in Switzerland). NZZ. Accessed August 2021.
  14. Enrique Alpanes (25 April 2021). Jürgen Schmidhuber, el hombre al que Alexa y Siri llamarían ‘papá’ si él quisiera hablar con ellas. El Pais. Accessed August 2021.
  15. 15.0 15.1 Lua error in package.lua at line 80: module 'strict' not found.
  16. Lua error in package.lua at line 80: module 'strict' not found.
  17. Lua error in package.lua at line 80: module 'strict' not found.
  18. 18.0 18.1 Dave O'Leary (3 October 2016). The Present and Future of AI and Deep Learning Featuring Professor Jürgen Schmidhuber. IT World Canada. Accessed April 2017.
  19. Lua error in package.lua at line 80: module 'strict' not found.
  20. Lua error in package.lua at line 80: module 'strict' not found.
  21. Lua error in package.lua at line 80: module 'strict' not found.
  22. Lua error in package.lua at line 80: module 'strict' not found.
  23. Lua error in package.lua at line 80: module 'strict' not found.
  24. Lua error in package.lua at line 80: module 'strict' not found.
  25. Lua error in package.lua at line 80: module 'strict' not found.
  26. Lua error in package.lua at line 80: module 'strict' not found.
  27. Lua error in package.lua at line 80: module 'strict' not found.
  28. Lua error in package.lua at line 80: module 'strict' not found.
  29. Lua error in package.lua at line 80: module 'strict' not found.
  30. Lua error in package.lua at line 80: module 'strict' not found.
  31. Lua error in package.lua at line 80: module 'strict' not found.
  32. Lua error in package.lua at line 80: module 'strict' not found.
  33. Lua error in package.lua at line 80: module 'strict' not found.
  34. Lua error in package.lua at line 80: module 'strict' not found.
  35. Lua error in package.lua at line 80: module 'strict' not found.
  36. Lua error in package.lua at line 80: module 'strict' not found.
  37. 37.0 37.1 37.2 Lua error in package.lua at line 80: module 'strict' not found.
  38. Lua error in package.lua at line 80: module 'strict' not found.
  39. Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, Backpropagation Applied to Handwritten Zip Code Recognition; AT&T Bell Laboratories
  40. Lua error in package.lua at line 80: module 'strict' not found.
  41. Lua error in package.lua at line 80: module 'strict' not found.
  42. Lua error in package.lua at line 80: module 'strict' not found.
  43. 43.0 43.1 Lua error in package.lua at line 80: module 'strict' not found.
  44. Lua error in package.lua at line 80: module 'strict' not found.
  45. INNS Awards Recipients. International Neural Network Society. Accessed December 2016.
  46. Recipients: Neural Networks Pioneer Award. Piscataway, NJ: IEEE Computational Intelligence Society. Accessed January 2019.
  47. Members. European Academy of Sciences and Arts. Accessed December 2016.