723 followers 146 článků/týdně
[D] Looking for an easy-to-read research paper for a presentation

Hello, I have a reading assignment for my course of advanced machine learning and I must pick one paper to present in front of the teacher from these 4 ML conferences: ASISTATS 2023 ICML 2023 NeurIPS 2023 ICLR 2023 I am looking for a research paper relatively easy to read and not too technical. Do you have any recommendation that you found interesting...

Wed May 29, 2024 15:55
[Discussion] GPT is short for Generative Pretraining or Generative Pretrained Transformers ?

Hello everyone. Recently I read on Wikipedia that GPT is short for Generative Pretrained Transformers: https: // en.wikipedia .org/wiki/Gener ative_pre-trained_transformer I have also seen some other places which also say this: http s://medium.com/@a nitakivindyo/what-ar e-generative-pre-trained-transformers-gpts-b37a8 ad94400 htt ps://aw s.amazon.com/what-i...

Wed May 29, 2024 15:55
[D] Friday Oxen.ai Paper Club: Extracting Interpretable Features from Claude 3 Sonnet

Hear the paper that Hugging Face cofounder Thomas Wolf called "totally based" interpreted through the lens of Oxen.ai CEO and Master-of-Plain-Speak-Delving: Greg Schoeninger. Register: https://lu.ma/oxen Friday 10:00 AM Pacific, 1:00 PM Eastern Time on Zoom Paper: https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html?s=09%2F/ ? Hey...

Wed May 29, 2024 15:55
[Project] Prompt Teacher - Free, educational tool teaching how to write effective LLM prompts

I'd like to share an educational prompt optimization tool called prompt teacher that I hope to be useful for the community :) Quickstart Guide 🚀 👉 Try the app directly without any setup: Prompt Teacher @ Huggingface Spaces 🔍 Inspect the code: GitHub: pwenker/prompt_teacher Hugging Face Spaces: pwenker/prompt_teacher Metaprompts Overview 📜 Here are...

Wed May 29, 2024 15:55
[R] Tool Learning with Large Language Models: A Survey

PDF: https://arxiv.org/abs/2405.17935 GitHub: https://github.com/quchangle1/LLM-Tool-Survey Abstract: Recently, tool learning with large language models (LLMs) has emerged as a promising paradigm for augmenting the capabilities of LLMs to tackle highly complex problems. Despite growing attention and rapid advancements in this field, the existing literature...

Wed May 29, 2024 12:55
[D] Data Scientist does the task without data

Recently I was assigned a task to build a user purchase scoring system based on user interaction activities. However, the funny thing is that I don't have data about user interactions with the product, so I surveyed the solutions of many parties and used my hypotheses to create the features which I thought will suitable to be able to build a prediction...

Wed May 29, 2024 12:55

Vytvořte si vlastní zdroj

Jste připraveni to vyzkoušet?
Spusťte 14denní zkušební verzi bez nutnosti platební karty.

Vytvořit účet