Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 2.9k 373

  2. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 660 126

  3. ARES ARES Public

    Python 437 48

  4. noscope noscope Public

    Accelerating network inference over video

    Python 436 122

  5. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 430 55

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 260 74

Repositories

Showing 10 of 69 repositories
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Python 172 Apache-2.0 17 2 0 Updated Sep 18, 2024
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 2,880 MIT 373 71 19 Updated Sep 4, 2024
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 83 Apache-2.0 17 3 1 Updated Aug 26, 2024
  • ARES Public
    stanford-futuredata/ARES’s past year of commit activity
    Python 437 Apache-2.0 48 9 2 Updated Aug 7, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 124 MIT 31 8 2 Updated Jul 25, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 7 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 31 2,317 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 14 5 0 0 Updated Jan 17, 2024
  • omg Public
    stanford-futuredata/omg’s past year of commit activity
    Python 20 Apache-2.0 3 0 0 Updated Sep 20, 2023
  • abae Public

    Accelerating Approximate Aggregation Queries with Expensive Predicates (VLDB 21)

    stanford-futuredata/abae’s past year of commit activity
    Python 3 1 0 0 Updated Sep 20, 2023