Like many companies, we're starting to develop applications with generative AI, primarily using the RAG architecture because frequent data changes make fine-tuning less effective. We're considering GPT-3.5, GPT-4, and Gemini, but want options beyond these due to cloud dependency and cost. We're planning a multi-model approach, selecting LLMs based on context size and other factors. Can you suggest a decision-making framework for evaluating LLMs?

3.7k views4 Upvotes2 Comments

Sort by:

IT Manager in IT Servicesa year ago

Hugging Face (https://huggingface.co) is something like GitHub for LLMs, Datasets etc. It will give your team the opportunity to test a variety of LLMs, both open source and proprietary, including OpenAI's, Microsoft's, and Google's models. Through my research I have found that while some models may not currently be the best in the market, they show great potential to become very good in the near future, particularly in the healthtech or fintech market.

Product Associate in Software2 years ago

Hi!
I cannot directly answer your question as I myself have used only the GPT family from OpenAI. However just for completeness I would add some good options to your list:
- GPT-family LLMs from Azure OpenAI services: provide similar value to the direct OpenAI API, but with more control, e.g. where the data is processed geographically;
- Claude API from Anthropic: haven't used but plan to check as the quality seems to be on par with OpenAI models.

Content you might like

Which tech conference do you intend to attend in person next year or have already participated in this year, and what motivated your choice?

As someone with no background in IT / CS, I am working on a project that requires me to get some understanding of the entire IT / systems landscape of a company - i.e. the various Hardware and Software requirements as well as the market leaders offering these products and services.

What report / webpage / any other resource would be a good starting point to understand this basket of markets?

What do you believe the future of AI roles will be at your organization?

Highly specialized AI roles (e.g., small number of experts)21%

Distributed AI skills across all roles (e.g., some AI skills required for most roles)71%

We won’t use AI at all6%

Not sure2%

View Results

Why do you think there are so few mature AI-driven autonomous pentesting solutions on the market, and why does this topic seem to generate more hype than in-depth technical discussion?

What are key ways in which your organization should hire affordable AI and data science talent? Check all that apply.

Recruit talent from diverse or non-traditional backgrounds (e.g. different degrees, institutions, or work experience)32%

Recruit less experienced AI talent with a high aptitude to learn 45%

Communicate the intrinsic benefits of the role (e.g., mission, culture, resources, opportunity for impact) 28%

Build talent pipelines through partnerships with academia and professional societies46%

Hire and upskill internal talent47%

Use specialized AI recruitment agencies11%

Other (please share details in comments)3%

View Results

Sort by:

Content you might like

Which tech conference do you intend to attend in person next year or have already participated in this year, and what motivated your choice?

What do you believe the future of AI roles will be at your organization?

Why do you think there are so few mature AI-driven autonomous pentesting solutions on the market, and why does this topic seem to generate more hype than in-depth technical discussion?

What are key ways in which your organization should hire affordable AI and data science talent? Check all that apply.

What sets us apart?

RELATED ONE-MINUTE INSIGHTS

CrowdStrike Outage: Impact And Recovery

DevSecOps: Strategies, Organizational Benefits and Challenges

Green Cloud Computing

Omnicloud: The Future of Cloud Computing?

2024 Software Engineering Priorities and Challenges

Take Your Insights On-the-Go