Unveiling the Power of Self-Attention for Shipping Cost Prediction: The Rate Card Transformer

被引：0

作者：

Sreekar, P. Aditya ^{[1
]}

Verma, Sahil ^{[1
]}

Madhavan, Varun ^{[1
,2
]}

Persad, Abhishek ^{[1
]}

机构：

[1] Amazon, Hyderabad, Telangana, India

[2] Indian Inst Technol, Kharagpur, W Bengal, India

来源：

ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222 | 2023年 / 222卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Amazon ships billions of packages to its customers annually within the United States. Shipping cost of these packages are used on the day of shipping (day 0) to estimate profitability of sales. Downstream systems utilize these days 0 profitability estimates to make financial decisions, such as pricing strategies and delisting loss-making products. However, obtaining accurate shipping cost estimates on day 0 is complex for reasons like delay in carrier invoicing or fixed cost components getting recorded at monthly cadence. Inaccurate shipping cost estimates can lead to bad decision, such as pricing items too low or high, or promoting the wrong product to the customers. Current solutions for estimating shipping costs on day 0 rely on tree-based models that require extensive manual engineering efforts. In this study, we propose a novel architecture called the Rate Card Transformer (RCT) that uses self-attention to encode all package shipping information such as package attributes, carrier information and route plan. Unlike other transformer-based tabular models, RCT has the ability to encode a variable list of one-to-many relations of a shipment, allowing it to capture more information about a shipment. For example, RCT can encode properties of all products in a package. Our results demonstrate that cost predictions made by the RCT have 28.82% less error compared to tree-based GBDT model. Moreover, the RCT outperforms the state-of-the-art transformer-based tabular model, FTTransformer, by 6.08%. We also illustrate that the RCT learns a generalized manifold of the rate card that can improve the performance of tree-based models.

引用

页数：13

共 50 条

[31] Decomformer: Decompose Self-Attention of Transformer for Efficient Image Restoration
Lee, Eunho
Hwang, Youngbae
IEEE ACCESS, 2024, 12 : 38672 - 38684
[32] Self-Attention Attribution: Interpreting Information Interactions Inside Transformer
Hao, Yaru
Dong, Li
Wei, Furu
Xu, Ke
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12963 - 12971
[33] RSAFormer: A method of polyp segmentation with region self-attention transformer
Yin X.
Zeng J.
Hou T.
Tang C.
Gan C.
Jain D.K.
García S.
Computers in Biology and Medicine, 2024, 172
[34] Singularformer: Learning to Decompose Self-Attention to Linearize the Complexity of Transformer
Wu, Yifan
Kan, Shichao
Zeng, Min
Li, Min
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4433 - 4441
[35] Nucleic Transformer: Classifying DNA Sequences with Self-Attention and Convolutions
He, Shujun
Gao, Baizhen
Sabnis, Rushant
Sun, Qing
ACS SYNTHETIC BIOLOGY, 2023, 12 (11): : 3205 - 3214
[36] ET: Re -Thinking Self-Attention for Transformer Models on GPUs
Chen, Shiyang
Huang, Shaoyi
Pandey, Santosh
Li, Bingbing
Gao, Guang R.
Zheng, Long
Ding, Caiwen
Liu, Hang
SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,
[37] Top-k Self-Attention in Transformer for Video Inpainting
Li, Guanxiao
Zhang, Ke
Su, Yu
Wang, JingYu
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1038 - 1042
[38] Additional Self-Attention Transformer With Adapter for Thick Haze Removal
Cai, Zhenyang
Ning, Jin
Ding, Zhiheng
Duo, Bin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[39] Transformer Self-Attention Change Detection Network with Frozen Parameters
Cheng, Peiyang
Xia, Min
Wang, Dehao
Lin, Haifeng
Zhao, Zikai
APPLIED SCIENCES-BASEL, 2025, 15 (06):
[40] Lightweight Vision Transformer with Spatial and Channel Enhanced Self-Attention
Zheng, Jiahao
Yang, Longqi
Li, Yiying
Yang, Ke
Wang, Zhiyuan
Zhou, Jun
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1484 - 1488

← 1 2 3 4 5 →