Learning general sparse additive models from point queries in high dimensions
Abstract
We consider the problem of learning a d-variate function f defined on the cube [-1,1]d⊂ Rd, where the algorithm is assumed to have black box access to samples of f within this domain. Denote Sr ⊂ [d] r; r=1,…,r0 to be sets consisting of unknown r-wise interactions amongst the coordinate variables. We then focus on the setting where f has an additive structure, i.e., it can be represented as f = Σ j ∈ S1 ϕ j + Σ j ∈ S2 ϕ j + … + Σ j ∈ Sr0 ϕ j, where each ϕ j; j ∈ Sr is at most r-variate for 1 ≤ r ≤ r0. We derive randomized algorithms that query f at carefully constructed set of points, and exactly recover each Sr with high probability. In contrary to the previous work, our analysis does not rely on numerical approximation of derivatives by finite order differences.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.