XCENA Raises $135M Series B to Build AI Inference Inside Memory, Not GPUs

$135M Series B

May 30, 2026 3 min read Business Wire Qualified Weak

Tech Jacks Solutions AI News Coverage

South Korean AI chip startup XCENA has closed a $135M Series B co-led by Atinum Investment and IMM Investment, valuing the company at approximately $570M post-money, according to a Business Wire announcement dated May 29, 2026. The company's MX1 chip targets a specific problem: LLM inference slows down not because GPUs run out of processing power, but because memory can't keep up with it.

ai-infrastructure-news xcena cxl-memory ai-chip-funding ai-startup-funding-news hbm inference-optimization south-korea-ai

XCENA post-money valuation, $570M

Key Takeaways

XCENA closed a $135M Series B at a $570M post-money valuation, all figures sourced from the company's Business Wire press release, pending URL resolution
The MX1 computational memory controller integrates RISC-V cores with up to 2 TB of DDR5 DRAM via CXL 3.x, targeting LLM decode-phase memory bandwidth as the primary inference bottleneck, per XCENA
XCENA claims a 10x inference server footprint reduction; this figure is vendor-stated and not independently verified
Mass production on Samsung 4nm targeted for late 2026; commercial revenue expected 2027, both company roadmap disclosures, not independently confirmed
The investment follows SK Hynix iHBM and Micron's Anthropic participation, suggesting a broadening conviction in memory-layer AI infrastructure plays

Funding Round

$135M (KRW 202B)

CompanyXCENA

RoundSeries B

Lead InvestorsAtinum Investment, IMM Investment (co-leads); SBI Investment, Mirae Asset Capital, Korea Development Bank, KDB Capital + 16 Korean institutional investors

Valuation$570M post-money

SectorAI Hardware / Inference Optimization

The bottleneck isn’t the GPU.

That’s the thesis behind XCENA’s $135M Series B. LLM inference during the decode phase, the part where the model generates each token of a response, is constrained primarily by memory bandwidth, not compute throughput. Expensive GPUs sit idle waiting for data. XCENA’s architecture moves the computation inside the memory rather than shuttling data to a processor, according to the company’s announcement.

The round was co-led by Atinum Investment and IMM Investment and included SBI Investment, Mirae Asset Capital, Korea Development Bank, KDB Capital, and 16 additional Korean institutional investors, according to the company’s Business Wire announcement. The $570M post-money valuation reflects $135M raised on approximately $435M pre-money. XCENA’s cumulative funding now stands at $185M.

The product is the MX1 computational memory controller. According to XCENA, the MX1 integrates thousands of RISC-V CPU cores directly alongside up to 2 TB of DDR5 DRAM, connected via CXL 3.x, the interconnect protocol designed to allow memory and compute resources to communicate across different chips and vendors. XCENA claims the architecture can reduce AI inference server footprint requirements by up to 10x, though this figure hasn’t been independently verified. The company plans to mass-produce MX1 on Samsung’s 4nm process in late 2026, with commercial revenue expected to begin in 2027, according to its announcement.

All figures in this brief originate from the company’s press release announcement. Source URLs are currently unresolved; the Business Wire press release (release ID 20260529005112) is the primary source pending URL confirmation.

The memory-bandwidth bottleneck thesis itself isn’t XCENA’s invention. It’s consistent with published AI inference research and has driven a wave of infrastructure investment this year, from SK Hynix’s iHBM announcement to Micron and Samsung’s participation in Anthropic’s Series H as strategic infrastructure partners, covered in ‘s infrastructure brief. XCENA’s bet is that solving the bottleneck happens at the controller layer, below HBM and above the CPU, by embedding computation directly in commodity DDR5 memory at CXL scale.

XCENA was founded by former Samsung Electronics and SK Hynix design executives, according to the company’s own materials, and has not been independently verified. If accurate, it’s meaningful context: the people who built the memory architecture are betting that the memory layer is where the inference optimization value gets captured.

What to Watch

Samsung 4nm production schedule, MX1 tape-out and volume production confirmationQ4 2026

XCENA enterprise design-win announcements, first external customer validation of 10x footprint claimQ1-Q2 2027

Business Wire source URL resolution, confirms all figures in this briefImmediate

The real story is

where in the inference stack the value ultimately concentrates. GPU vendors claim the compute layer. HBM manufacturers are capturing the high-bandwidth memory layer, SK Hynix and Micron both crossed $1T market caps on AI infrastructure demand. XCENA is making a separate bet: that a controller layer between the two is where the real efficiency gains live in LLM decode workloads. Twenty Korean institutional investors put $135M behind that thesis.

The 2027 revenue timeline is the first validation gate. If MX1 ships on Samsung 4nm in late 2026 and enterprise customers adopt it at scale, the footprint reduction claim becomes testable. Watch the Samsung 4nm production schedule and any XCENA design-win announcements in Q4 2026.

View Source

More Markets intelligence

View all Markets