Perplexity

company

Verified

https://www.perplexity.ai/

AI & ML interests

None defined yet.

Recent Activity

bowang0911 updated a model about 20 hours ago

perplexity-ai/pplx-embed-v1-late-0.6b

bowang0911 updated a collection 4 days ago

bowang0911 published a model 5 days ago

perplexity-ai/pplx-embed-v1-late-0.6b

View all activity

Papers

RDMA Point-to-Point Communication for LLM Systems

View all Papers

updated a model about 20 hours ago

perplexity-ai/pplx-embed-v1-late-0.6b

Feature Extraction • 0.6B • Updated about 20 hours ago • 4.87k • 16

updated a collection 4 days ago

pplx-embed

Diffusion-Pretrained Dense and Contextual Embeddings • 8 items • Updated 4 days ago • 96

published a model 5 days ago

perplexity-ai/pplx-embed-v1-late-0.6b

Feature Extraction • 0.6B • Updated about 20 hours ago • 4.87k • 16

in perplexity-ai/pplx-embed-v1-4b about 2 months ago

Can't serve the model using TEI

#12 opened 3 months ago by

TomaszZietkiewicz

fix: add fast tokenizer

#13 opened about 2 months ago by

authored 2 papers about 2 months ago

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Paper • 2501.01005 • Published Jan 2, 2025 • 2

RDMA Point-to-Point Communication for LLM Systems

Paper • 2510.27656 • Published Oct 31, 2025 • 8

authored 2 papers over 2 years ago

Punica: Multi-Tenant LoRA Serving

Paper • 2310.18547 • Published Oct 28, 2023 • 2

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Paper • 2310.19102 • Published Oct 29, 2023 • 11

authored a paper almost 3 years ago

FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning

Paper • 2210.12873 • Published Oct 23, 2022