The Data Labeling Marketplace | Where AI Builders and AI Trainers Connect to Build the Future | Find, hire, & securely pay data labelers for any annotation toolopentrain.ai Seattle, WashingtonJoined September 2023
At this point I feel like we understand pretty well what's going on with LLMs:
- Outputs are roughly equivalent to kernel smoothing over positional embeddings (arxiv.org/pdf/1908.11775…)
- The learned computation model is *probably* bounded by RASP-L (arxiv.org/pdf/2310.16028…)
-…
The perfect quote to describe LLMs can be found in a 1946 Jean Cocteau movie -- "Réfléchissez pour moi, je réfléchirai pour vous" (think for me, I will reflect for you).
What you get from the model is always a reflection of the training data you put in -- itself a by-product of…
Grok-1 by @xai utilized "AI Tutors", or human subject matter experts to create custom training data & provide RLHF.
We're making this easy for anybody to do this with OpenTrain.ai: The data labeling marketplace to find, hire, & pay training data experts for ANY data…
🔥Excited to introduce LMSYS-Chat-1M, a large-scale dataset of 1M real-world conversations with 25 cutting-edge LLMs!
This dataset, collected from chat.lmsys.org, offers insights into user interactions with LLMs and intriguing use cases.
Link: huggingface.co/datasets/lmsys…
An in-depth look at RLHF by @natolambert from @huggingface. The need for high-quality, task-specific data in RLHF is crucial. With OpenTrainAI, you can find, hire, & pay the human experts essential for responsible and effective RLHF. Post your job today! #RLHF#MachineLearning
An in-depth look at RLHF by @natolambert from @huggingface. The need for high-quality, task-specific data in RLHF is crucial. With OpenTrainAI, you can find, hire, & pay the human experts essential for responsible and effective RLHF. Post your job today! #RLHF#MachineLearning
This is the way to unlock the next trillion high-quality tokens, currently frozen in textbook pixels that are not LLM-ready.
Nougat: an open-source OCR model that accurately scans books with heavy math/scientific notations. It's ages ahead of other open OCR options. Meta is…
1 Followers 64 FollowingThe largest and most complete list of AI data annotation, labeling, and services companies on the internet: https://t.co/Gp6Gc4D4Wt
212 Followers 2K FollowingBD @KGeN_IO | Gamer & Nerd
There is a group of people that believes, web3 gaming has no future.
I DO NOT associate with that group.
44 Followers 292 FollowingData Annotator | Specializing in Machine Learning Annotation
for Large Language Models | Expert in Accurate & Efficient Data
Labeling for AI
534 Followers 7K FollowingTechnology • Philosophy • Health • Space-faring • History • engineer/scientist • Building https://t.co/9436E2wmt8 - AI fun just for you
2 Followers 188 FollowingExperienced Senior Full Stack and Web3 Engineer with a proven track record of developing innovative and impactful web applications by using React.js, Next.js, A
32 Followers 136 FollowingNeed data for your AI? Sure, let random clickers label your brain scans. What could go wrong?🤷
I talk about why 99% of the industry gets AI datasets wrong
7K Followers 3K FollowingScience & technology enthusiast. On my Substack I write about metascience, AI, & other topics. Leave me anonymous feedback here: https://t.co/LQ5eZWwDst
55K Followers 1K FollowingWe build fresher maps for humanity.
Join a decentralized global community of mappers. Earn rewards. Change the world. Built on @solana, the home of #DePIN.
102K Followers 28 FollowingBuild AI agents over your documents
Github: https://t.co/HC19j7veGE
Docs: https://t.co/QInqg2yMCJ
LlamaCloud: https://t.co/yQGTiRSfFL
19K Followers 9K FollowingOn the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. https://t.co/mMchI2d4pg Upskilling @StanfordOnline
5K Followers 880 FollowingEvery age, it seems, is tainted by the greed of men. Rubbish to one such as I, devoid of all worldly wants. — I work on HPC and making AI run faster.
327K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
6K Followers 1K FollowingUBDC is a national research centre and data service providing expertise, training, extensive data collections and data tools. Funded by @ESRC & @UofGlasgow
18K Followers 4K FollowingPremier training ground for data scientists. We provide accelerated programs in Data Analytics & Visualization, Machine Learning, Big Data, and Deep Learning.
4K Followers 1 Followingcurations to pass into the world of wonder, self-reflection, & aliveness
🕳️🐇 curator @patriciamou_
🆕 book released: https://t.co/cYKhXKZNZf