共 50 条
- [42] Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models COMPUTER VISION - ECCV 2024, PT LXXIII, 2025, 15131 : 174 - 189
- [45] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2015 - 2040
- [46] "Turning right"? An experimental study on the political value shift in large language models HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2025, 12 (01):
- [48] Unpacking the Ethical Value Alignment in Big Models Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (09): : 1926 - 1945
- [49] Large language models can infer psychological dispositions of social media users PNAS NEXUS, 2024, 3 (06):
- [50] Large language models to identify social determinants of health in electronic health records npj Digital Medicine, 7