Applying massive language models in the real world with Cohere – Jay Alammar – Visualizing machine learning one concept at a time.

A little less than a year ago, I joined the awesome Cohere team. The company trains massive language models (both GPT-like and BERT-like) and offers them as an API (which also supports finetuning). Its founders include Google Brain alums including co-authors of the original Transformers paper. It’s a fascinating role where I get to help companies and developers put these massive models to work solving real-world problems.

I love that I get to share some of the intuitions developers need to start problem-solving with these models. Even though I’ve been working very closely on pretrained Transformers for the past several years (for this blog and in developing Ecco), I’m enjoying the convenience of problem-solving with managed language models as it frees up the restrictions of model loading/deployment and memory/GPU management.

These are some of the articles I wrote and collaborated on with colleagues over the last few months:

Breaking

Applying massive language models in the real world with Cohere – Jay Alammar – Visualizing machine learning one concept at a time.

Intro to Large Language Models with Cohere

A visual guide to prompt engineering

Text Summarization

Semantic Search

Finetuning Representation Models

Controlling Generation with top-k & top-p

Text Classification Using Embeddings

By dubai.digital

Leave a Reply

You Missed

Windsurf – AI programming tools launched by Codeium | AI Tool Set

Looking for Deals in Dubai?

Top Podcasts Of Dubai

Dubai Hills Estate vs. Emirates Hills – Which One is Right for You?

Our Company

Applying massive language models in the real world with Cohere – Jay Alammar – Visualizing machine learning one concept at a time.

By dubai.digital

Related Posts

Leave a Reply

You Missed