Still having some free 4090 cards in hand, can help to run some experiments or do some paper works. Just print my name (any position) would be OK, DM me if needed :) submitted by /u/ImplementFew2472 [link] [comments]
Can anyone help me with the Optical flow for video classification. For eg. Human Activity Classification. I didnt find any tutorials from scratch for code all I just found were papers. submitted by /u/V1P3R_KN07 [link] [comments]
I seem to have stumbled upon a problem that i can't google my way out of. [MY TRAINING DATA] I have a dataset of bunch of sequential events. each event has 30-40 attributes, including the timestamp the event occured. user 1: Event 1 > Event 2 > Event 3 user 2: Event 1 > Event 2 > Event 3 > Event 4 > Event 5 user 3: Event 1 .......
I want the create a custom environment and do benchmarks on it using varius MCTS algorithms from LightZero https://github.com/opendilab/LightZero . However, I find the implementation confusing and complicated, does somenone have a clean extension of this algorithm that defines its own environment and runs varius algorithms on it ? submitted by ...
Sharing a video from my YT channel that breaks down the new KAN paper. It goes into all the core concepts required to understand the paper - the Kolmogorov Arnold Representation Theorem, Splines, MLPs, comparisons between MLPs and KANs, challenges ahead, and highlights some of the amazing properties/results of KANs like continual learning, sparsification,...
What are your best guesses on how it works (training and architecture) vs. the typical VL formula of pretrained vision encoder + pretrained LLM -> fine-tune with multimodal tasks? E.g. Is it fully mixed modality pre-training the entire system? Does model embed all modalities into a shared space for prediction? Does the system "self-select" the modality...
Създайте своя емисия с новини
Готови ли сте да опитате?
Стартирайте 14-дневен пробен период, не се изисква кредитна карта.