“可预测的”非凸在线点对学习的遗憾界

打印
收藏

收藏成功

微博 QQ空间微信

打开文本图片集

中图分类号： TP301. 5 文献标志码：ADOI： 10.13705/j . issn. 1671-6841. 2023199

Abstract： Online pairwise learning is a machine learning model in which the loss functions depend on a pair of instances. Generalization is an important aspect of online pairwise learning theory research. Most of the existing works on online pairwise learning used adversarial loss functions and provided regret bounds only with convex loss functions. However， convexity was not typically applicable in practical scenarios. For non-convex online pairwise learning， the regret bound of online pairwise learning with a " predictable" loss function based on stability analysis was provided and the corresponding stability analysis was conducted. Through the relationship between stability and regret， a common way to measure the generalization ability of online pairwise learning， the regret bound was established with a " predictable" non-convex loss function. It was proved that when the learner obtained an offline oracle， " predictable" non-convex generalized online pairwise learning reached the regret bound of O（T-3/2） . This study enriched the theoretical research on non-convex online pairwise learning and was superior to the existing theoretical guarantees.

Key words： online pairwise learning; non-convex; stability; regret bounds; offline optimization oracle

0 引言

在线点对学习主要应用于排序[ 1-3] 、度量学习[ 4] 等。（剩余11777字）

试读结束

购买全文6.00元下一篇基于列生成和遗传算法的三阶段齐切割问题

郑州大学学报（理学版）

2025年04期

¥4.00/本