Computation and Language

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural
  Language and Reinforcement Learning
Avatar
Jiahao Xu
0 views
Learning Composable Chains-of-Thought
Avatar
librarian
0 views
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken
  Language Understanding
Avatar
Alkis Koudounas
6 views
THiNK: Can Large Language Models Think-aloud?
Avatar
Yongan Yu
4 views
Do Large Language Models Excel in Complex Logical Reasoning with Formal
  Language?
Avatar
Jin Jiang
3 views
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent
  Systems
Avatar
librarian
5 views
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs
  via Reinforcement Learning
Avatar
librarian
7 views
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
  Concept Space
Avatar
librarian
6 views
A Federated Splitting Framework for LLMs: Security, Efficiency, and
  Adaptability
Avatar
librarian
5 views
VerifyBench: Benchmarking Reference-based Reward Systems for Large
  Language Models
Avatar
librarian
5 views
BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information
  Retrieval
Avatar
Hervé Onguéné
13 views
Learning Dynamics in Continual Pre-Training for Large Language Models
Avatar
librarian
12 views
ComPO: Preference Alignment via Comparison Oracles
Avatar
librarian
12 views
Reasoning Models Don't Always Say What They Think
Avatar
Yanda Chen
17 views
Whisper-LM: Improving ASR Models with Language Models for Low-Resource
  Languages
Avatar
Hussein Kedir
23 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
경택 오
77 views
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Avatar
yorba
58 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Ilya Baimetov
259 views
A Pipeline For Discourse Circuits From CCG
Avatar
ScienceCast Board
210 views
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
Avatar
Yael Flax
213 views
Meta-path Augmented Response Generation
Avatar
ScienceCast Board
199 views
CliNER 2.0: Accessible and Accurate Clinical Concept Extraction
Avatar
Sasa Pure
181 views
A Hybrid Architecture for Multi-Party Conversational Systems
Avatar
priaon-flag
189 views
Analyzing the Structure of Attention in a Transformer Language Model
Avatar
levymoshe16
205 views
Direct Neural Machine Translation with Task-level Mixture of Experts
  models
Avatar
Isidora Tourni
205 views
Transformers as Soft Reasoners over Language
Avatar
ScienceCast Board
212 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
ScienceCast Board
466 views
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Avatar
Xing Han
233 views
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in
  Closed-Source LLMs
Avatar
burke-atilla
195 views
Defending Against Authorship Identification Attacks
Avatar
hw56
202 views
A Content-Based Novelty Measure for Scholarly Publications: A Proof of
  Concept
Avatar
hw56
191 views
The Many Voices of Duying: Revisiting the Disputed Essays Between Lu Xun
  and Zhou Zuoren
Avatar
hw56
210 views