691 followers 148 articles/semaine
Need help with RAG chatbot [Project]

I'm building a RAG chatbot that gives you the contextual information on the documents uploaded into the database connected to the chatbot. Now, I'm trying to implement a feature wherein the user can use a hash(#) to instruct the bot to point to a specific document within a db and ask questions about that specific doc. Please help me on how to implement...

Tue May 14, 2024 12:42
[R] How Well Can Transformers Emulate In-context Newton's Method?

Paper: https://arxiv.org/abs/2403.03183 Code: https://anonymous.4open.science/r/transformer_higher_order-B80B/ Abstract: Transformer-based models have demonstrated remarkable in-context learning capabilities, prompting extensive research into its underlying mechanisms. Recent studies have suggested that Transformers can implement first-order optimization...

Tue May 14, 2024 12:42
[P] A Dataset for The Global Artificial Intelligence Championship Math 2024

Dataset and code: https://github.com/protagolabs/odyssey-math AGI Odyssey: https://www.agiodyssey.org Description: The Global Artificial Intelligence Championship(GAIC) Math 2024 presents a collection of 387 meticulously crafted math problems, meticulously curated by professional math problem writers from both universities and high schools. The compilation...

Tue May 14, 2024 12:42
[D] Language model for TimeSeries Forecasting from Amazon

Time series forecasting is super important for many industries, like retail, energy, finance, etc. I delivered many projects in this area with statistical models, deep learning models (LSTM, CNN) and always it was a challenge. With a great development in language model space I was thinking how LLM architecture could be used for forecasting and while...

Tue May 14, 2024 12:42
[D] Machine Learning Foundations: A Case Study Approach

Hi there, I'm running into a bit of an issue with this course. It seems the libraries used, GraphLab and Turi Create, are outdated and no longer commonly used. Is there an alternative way to practice the concepts covered in the course? Ideally, I'd like to practice the lessons using more current libraries. submitted by /u/Lemikaa [link] [comments]

Tue May 14, 2024 12:42
[D] Have someone tried to implement KANs from scratch?

Recently I have been hearing a lot about this new architecture (kolmogorov-Arnold Networks) that might bring a new revolution in Deep Learning domain. Since many years MLP was the only architecture that was being used to solve any problem using neural networks, thus announcement of this new architecture is definitely a break through. Though many times...

Tue May 14, 2024 12:42

Construisez votre propre fil d'actualité

Prêt à tenter le coup ?
Commencer un essai de 14 jours, aucune carte de crédit n'est requise.

Créer un compte