Technology
Spark
Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.
Spark handles massive workloads by processing data in-memory, delivering speeds up to 100x faster than traditional MapReduce. It supports SQL queries, streaming data, and complex analytics through libraries like MLlib and GraphX. Engineers use it to manage petabytes of data across thousands of nodes (nodes are the physical or virtual machines in a cluster). With native support for Python, Scala, Java, and R, it remains the gold standard for unified batch and real-time processing.
5 projects
·
9 cities
Related technologies
Recent Talks & Demos
Showing 1-5 of 5
NVIDIA Grace-Blackwell: Local AI Supercomputing
Paris
Mar 17
Grace-Blackwell
DGX Spark
OpenCode Local Models on DGX
Seattle
Jan 30
OpenCode
NVIDIA DGX Spark
LLM Agents Debate Bitcoin Fraud
Toronto
Jan 29
Google
Gemini
Archingeo: AI Safety in Infrastructure
Orange County
Jan 14
NVIDIA Jetson
PyTorch
Glimpse Insights Demo
Los Angeles
Aug 17
Glimpse Insights
GPT-4