Toward Ownership Understanding of Objects:
Active Question Generation with Large Language Model and Probabilistic Generative Model

Saki Hashimoto¹, Shoichi Hasegawa¹, Tomochika Ishikawa¹, Akira Taniguchi²,
Yoshinobu Hagiwara^3,4, Lotfi El Hafi⁴, Tadahiro Taniguchi^4,5

¹Graduate School of Information Science and Engineering, Ritsumeikan University
²College of Information Science and Engineering, Ritsumeikan University
³Faculty of Science and Engineering, Soka University
⁴Research Organization of Science and Technology, Ritsumeikan University
⁵Graduate School of Informatics, Kyoto University

Paper

Abstract

Robots operating in domestic and office environments must understand object ownership to correctly execute instructions such as “Bring me my cup.” However, ownership cannot be reliably inferred from visual features alone. To address this gap, we propose Active Ownership Learning (ActOwL), a framework that enables robots to actively generate and ask ownership-related questions to users. ActOwL employs a probabilistic generative model to select questions that maximize information gain, thereby acquiring ownership knowledge to improve learning efficiency. Additionally, by leveraging commonsense knowledge from Large Language Models (LLM), objects are pre-classified as either shared or owned, and only owned objects are targeted for questioning. Through experiments in a simulated home environment and a real-world laboratory setting, ActOwL achieved significantly higher ownership clustering accuracy with fewer questions than baseline methods.

Overview

Overview of this study: (a) Without ownership knowledge, the robot cannot follow instructions containing owner names. (b) The robot generates questions for the user. (c) The probabilistic generative model is updated to predict object ownership based on the user’s answers, enabling ownership learning.

Proposed Method: Active Ownership Learning (ActOwL)

Overview of the proposed method: (a) The robot explores the environment to obtain object positions and attributes, training an initial ownership model. (b) Objects are classified as owned or shared using commonsense knowledge from the LLM. (c) Classification results are treated as pseudo-answers to update the ownership distribution. (d) For owned objects, the robot computes IG, selects the most informative one, generates a question via LLM, and updates the model based on the user’s answer.

Shared / Owned Classification Prompt

Before actively asking questions, the robot estimates whether each object is likely shared or individually owned using commonsense knowledge from a Large Language Model (LLM). This step reduces unnecessary questions by excluding obviously shared objects.

Prompt Example


You are a household assistant robot.

Given the following object name, determine whether the object is
more likely to be a shared object or an individually owned object
in a typical household.

Object: "red mug"

Answer with one of the following labels:
- Shared
- Owned

Provide only the label.

The classification result is treated as a prior (pseudo-answer) in the probabilistic ownership model. Objects classified as shared are excluded from active questioning.

Probabilistic Generative Model for Ownership

Ownership is modeled as a latent variable inferred from object attributes, spatial information, and user responses. The model maintains uncertainty and is updated online through interaction.

The probabilistic generative model integrates:

Object attributes and locations
User identity
User answers to ownership-related questions

After each interaction, the posterior distribution over ownership is updated using Bayesian inference, enabling incremental ownership learning.

Active Question Generation Prompt

To efficiently reduce ownership uncertainty, the robot selects an object that maximizes information gain (IG) and generates a natural-language question using an LLM.

Prompt Example


You are a service robot interacting with a user.

Your goal is to determine the owner of an object.
Generate a polite and natural question to ask the user
about the ownership of the following object.

Object: "red mug"

Constraints:
- The question should be short and clear.
- Ask only one question.
- Do not include explanations.

The LLM is used only for language generation, while the selection of the object to ask about is determined by the probabilistic model based on information gain.

User Recognition Prompt

To correctly update ownership beliefs, the robot must identify which user is responding to the question. User recognition conditions the probabilistic update of ownership.

Prompt Example


You are a robot interacting with multiple users.

Based on the following information, identify the user
who is currently speaking.

Input:
- Detected face ID: User_2
- Voice direction: front
- Current interaction context: ownership question

Output the user ID only.

The identified user ID is used to update the ownership distribution, enabling multi-user ownership learning in shared environments.

BibTeX


@article{hashimoto2026actowl,
  author  = {Hashimoto, Saki and Hasegawa, Shoichi and Ishikawa, Tomochika and Taniguchi, Akira and Hagiwara, Yoshinobu and El Hafi, Lotfi and Taniguchi, Tadahiro},
  title   = {Toward Ownership Understanding of Objects: Active Question Generation with Large Language Model and Probabilistic Generative Model},
  journal = {Artificial Life and Robotics},
  year    = {2026},
  note    = {under review}
}

Laboratory Information

Funding

This work was supported by JSPS KAKENHI Grants-in-Aid for Scientific Research (Grant Numbers JP23K16975, JP25K15292) and JST Moonshot Research & Development Program (Grant Number JPMJMS2011).