... ingestion, validation, and storage for multi-terabyte to petabyte-scale AI ... Apache Spark, Ray, or Dask Skilled in working with distributed data ... with them Comfortable working with multi-terabyte or larger datasets Hands ...
6 days ago