Autonomous Learning Agents

Learning on the Job - Continual Learning after Deployment

Interactive AI and Lifelong and Continual Learning Autonomous Learning: Self-initiated Open-world Continual Learning and Adaptation

Second Edition: "Lifelong Machine Learning." by Z. Chen and B. Liu, Morgan & Claypool, August 2018 (1st edition, 2016)

Three new chapters have been added and others have been updated and/or reorganized.
One Chapter is dedicated to Open World Learning
Any AI system (e.g., chatbot and self-driving car) that cannot learn in deployment (e.g., chatting and driving) in the real-world open envoronment is not truly intelligent.

Continual Learning Dialogue Systems - Learning on the Job after Model Deployment. Tutorial @ IJCAI-2021, August 21-26, 2021, Montreal, Canada.
Continual and Interactive Learning after Model Deployment. Invited talk @ Allen Institute for Artificial Intelligence (AI2), June 18, 2021.
Continual Learning Dialogue Systems - Learning after Model Deployment. Invited talk @ ICLR-21 Workshop on Neural Conversational AI, May 7, 2021.
Learning on the Job in the Open World. Invited talk @ Information Sciences Institute, Univesity of Southern California, Sept.11, 2020.
Learning on the Job in the Open World. Invited talk @ ICML-2020 Workshop on Continual Learning, July 17, 2020.
Lifelong Machine Learning Tutorial. Title: lifelong machine learning and computer reading the Web, KDD-2016, August 13-17, 2016, San Francisco, USA.
Lifelong Machine Learning Tutorial, IJCAI-2015, July 25-31, 2015, Buenos Aires, Argentina.

Motivation for Autonomous Learning: It is known that about 70% of our human knowledge comes from “on-the-job” learning. Only about 10% is learned through formal education and the rest 20% is learned through observation of others (imitation). An autonomous AI system must have this capability - its machine learning algorithm must be able to learn autonomously on the job or while working after model deployment. As the real world is too complex and constantly changing, it is impossible to learn everything through offline training using manually labeled data. An autonomous learning agent has to explore and learn by itself in the real world, which is open and constantly change - full of unknowns. AI agents must be able to detect the unknowns and learn them in a self-supervised manner through its interaction with humans, other agents, and the real-world environment. It should not make the closed-world assumption any more. Here are two motiviating examples:

Greeting bot in a hotel: At any point in time, the bot has learned to recognize all existing hotel guests. When it sees an existing guest, it can call him/her by his/her first name and chat (e.g., Hi John, how are you today?). It must also detect any new guests that it has not seen before. On seeing a new guest, it can say hello, ask for his/her name (e.g., "Welcome to our hotel. What is your name, sir?"), take pictures, and learn to recognize the guest. Next time when it sees the new guest again, it can call him/her by his/her first name and chat like an old friend.
Self-driving car: I worked on self-driving car for a year. Once when we were doing a road test, the car suddenly stopped and refused to move. The road was completely clear and we could not see anything wrong. Debugging back in the lab found that there was a small stone on the road detected by a sensor. This made me think: why cannot the car tell us what the problem was in natural language? Why cannot we tell the car to go ahead also in natural language? Our instruction serves a piece of supervisory information to enable the car to learn so that in the future when a simialr situation occurs, it can behave correctly.

Autonomous Learning: Like human on-the-job learning, it studies learning after model deployment or during model application (or testing) - after a good model has been built and deployed in an application. In classic machine learning, once a model is built, it is deployed in an application. During application, the model remains fixed or unchanged. Autonomous learning (or on-the-job learning) investigates continuous learning after model deployment, which involves the following steps

continuously discover new tasks to learn by the agent itself. This is called Open World Learning or Out-of-Distribtuion Detection.
gather "free" training data through the agent’s own active effort via interaction with humans, other agents and the environment.
incrementally learn the new tasks without interrupting the application to become more and more knowledgeable. This is continual learning.

Here we emphesize interaction with humans and the environment to find new tasks and to naturally label training data (see the two motivating examples above).

Interactive AI: This is also called Interactive Self-Supervision. Step 2 is the key for on-the-job learning, i.e., how to find the hidden classes and obtain labeled training data. This must be done through actions initiated by the system itself without interrupting the application. That is, it must learn actively on its own based on its prior knowledge by observing and interaction with the environment and humans to obtain explicit or implicit feedback to serve as supervision. The interaction with humans should be through natural language dialogues. It is not possible for an autonomous intelligent agent to rely solely on massive manual labeled training data to learn passively offline forever.

Publications

TextBook: Zhiyuan Chen and Bing Liu. Lifelong Machine Learning. Morgan & Claypool, 2018 (2nd edition), 2016 (1st edition).

Bing Liu, Sahisnu Mazumder, Eric Robertson, and Scott Grigsby. AI Autonomy: Self-Initiation, Adaptation and Continual Learning. arXiv:2203.08994 [cs.AI], March 17, 2022.
Bing Liu and Sahisnu Mazumder. Lifelong and Continual Learning Dialogue Systems: Learning during Conversation. to appear in Proceedings of AAAI-2021. 2021.
Sahisnu Mazumder, Bing Liu, Shuai Wang, and Sepideh Esmaeilpour. An Application-Independent Approach to Building Task-Oriented Chatbots with Interactive Continual Learning. to appear at NeurIPS-2020 Workshop on Human in the Loop Dialogue Systems (HLDS-2020). 2020.
Sahisnu Mazumder, Bing Liu, Nianzu Ma, Shuai Wang. Continuous and Interactive Factual Knowledge Learning in Verification Dialogues. to appear at NeurIPS-2020 Workshop on Human And Machine in-the-Loop Evaluation and Learning Strategies (HAMLETS-2020). 2020.
Bing Liu and Chuhe Mei. Lifelong Knowledge Learning in Rule-based Dialogue Systems. arXiv:2011.09811 [cs.AI], 2020.
Bing Liu. Learning on the Job: Online Lifelong and Continual Learning. Proceedings of 34th AAAI Conference on Artifical Intelligence (AAAI-2020), Feb 7-12, 2020, New York City. (This work was done while I was on leave in Peking University).
Sahisnu Mazumder, Bing Liu, ShuaiWang, Nianzu Ma. Lifelong and Interactive Learning of Factual Knowledge in Dialogues. to appear in Proceedings of Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL-2019), 11-13 September 2019, Stockholm, Sweden.
Hu Xu, Bing Liu, Lei Shu and P. Yu. Open-world Learning and Application to Product Classification. to appear in Proceedings of the Web Conference (formerly known as the WWW conference), San Francisco, May 13-17, 2019.
Lei Shu, Hu Xu, Bing Liu. Unseen Class Discovery in Open-world Classification. arXiv:1801.05609 [cs.LG], 2018.
Lei Shu, Hu Xu, Bing Liu. DOC: Deep Open Classification of Text Documents. Proceedings of 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP-2017, oral presentation short paper), September 7–11, 2017, Copenhagen, Denmark.
Geli Fei, Shuai Wang, and Bing Liu. 2016. Learning Cumulatively to Become More Knowledgeable. Proceedings of SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2016), August 13-17, San Francisco, USA.
Geli Fei, and Bing Liu. 2016. Breaking the Closed World Assumption in Text Classification. Proceedings of NAACL-HLT 2016 , June 12-17, San Diego, USA.

Created on July 15, 2020 by Bing Liu.