.csv

Search services

Search tools and open pages quickly

Ürün rehberi · 6 dk okuma

ML feature CSVs: eyeballing training exports before notebooks run

Spot constant columns, label leakage, and impossible ranges in flat files before sklearn or PyTorch.

Yayın 21 Mart 2025 · Table

Human scan complements automated profiling: sort numeric features, search for sentinel strings like unknown, and verify label cardinality before training.

Red flags

  • Future-dated columns co-present with targets (leakage).
  • IDs that sort perfectly with labels (merge bugs).

← Tüm yazılar

Önde gelen ekipler tarafından kullanılıyor

Kaydırmalı logolar (her biri marka sitesini yeni sekmede açar): Google, Apple, Meta, GitHub, Stripe, Shopify, Databricks, Snowflake, Notion, Vercel, Intel, NVIDIA, Netflix, Spotify, Airbnb, Yale, Harvard University, Massachusetts Institute of Technology, Stanford University, University of California, Berkeley, Princeton University, California Institute of Technology, Columbia University, University of Chicago, Cornell University, Duke University, Carnegie Mellon University, Georgia Institute of Technology, Johns Hopkins University, Northwestern University, University of Toronto, McGill University, University of Oxford, University of Cambridge, Imperial College London, University College London, ETH Zurich, EPFL, Technical University of Munich, Sorbonne University, KU Leuven, National University of Singapore, Nanyang Technological University, Tsinghua University, Peking University, The University of Tokyo, KAIST, Seoul National University, University of Melbourne, Australian National University, University of Sydney, The University of Hong Kong.