• soboleffspaces Profile Picture

    Boris Sobolev @soboleffspaces

    a month ago

    Thank you Elias for pushing our thinking this direction, and please pass my admiration to Drago for his productivity… I’m wondering why did you use another LLM as the ground truth here? it just adds another abstraction… I mean, wouldn’t it be more convincing to estimate distribution via some cross-classification table, like it’s done in epidemiology (sorry, couldn’t resist 😊) and then check it against the performance of llms, btw, pls consider the seer database, seer.cancer.gov/data/access.ht… the best population-based registry in the world… health surveys data are laced with selection bias, and rarely let to even descriptional evidence… but SEER is simply the best — curated, studied, and recognized by cancer epidemiologists

    eliasbareinboim Profile Picture

    Elias Bareinboim @eliasbareinboim

    a month ago

    Thank you Elias for pushing our thinking this direction, and please pass my admiration to Drago for his productivity… I’m wondering why did you use another LLM as the ground truth here? it just adds another abstraction… I mean, wouldn’t it be more convincing to estimate distribution via some cross-classification table, like it’s done in epidemiology (sorry, couldn’t resist 😊) and then check it against the performance of llms, btw, pls consider the seer database, seer.cancer.gov/data/access.ht… the best population-based registry in the world… health surveys data are laced with selection bias, and rarely let to even descriptional evidence… but SEER is simply the best — curated, studied, and recognized by cancer epidemiologists

    1 0 5 3K 0

    1 0 3 996 1
  • eliasbareinboim Profile Picture

    Elias Bareinboim @eliasbareinboim

    a month ago

    @soboleffspaces Thank you, Boris! We didn’t use other LLMs as ground truth, but the datasets listed on p. 5: causalai.net/r136.pdf. We’ve been looking for additional sources of ground truth and enlarging the benchmark, so the link is appreciated. Of course, open to collaborations!

    1 0 2 347 0
  • Download Image
    • Privacy
    • Term and Conditions
    • About
    • Contact Us
    • TwStalker is not affiliated with X™. All Rights Reserved. 2024 instalker.org

    twitter web viewer x profile viewer bayigram.com instagram takipçi satın al instagram takipçi hilesi twitter takipçi satın al tiktok takipçi satın al tiktok beğeni satın al tiktok izlenme satın al beğeni satın al instagram beğeni satın al youtube abone satın al youtube izlenme satın al sosyalgram takipçi satın al instagram ücretsiz takipçi twitter takipçi satın al tiktok takipçi satın al tiktok beğeni satın al tiktok izlenme satın al beğeni satın al instagram beğeni satın al youtube abone satın al youtube izlenme satın al metin2 metin2 wiki metin2 ep metin2 dragon coins metin2 forum metin2 board popigram instagram takipçi satın al takipçi hilesi twitter takipçi satın al tiktok takipçi satın al tiktok beğeni satın al tiktok izlenme satın al beğeni satın al instagram beğeni satın al youtube abone satın al youtube izlenme satın al buyfans buy instagram followers buy instagram likes buy instagram views buy tiktok followers buy tiktok likes buy tiktok views buy twitter followers buy telegram members Buy Youtube Subscribers Buy Youtube Views Buy Youtube Likes forstalk postegro web postegro x profile viewer