It looks like you're offline.
Open Library logo
additional options menu
Open Library is running in limited-availability mode: login is disabled and some books may appear unavailable

MARC Record from marc_columbia

Record ID marc_columbia/Columbia-extract-20221130-032.mrc:121382088:4222
Source marc_columbia
Download Link /show-records/marc_columbia/Columbia-extract-20221130-032.mrc:121382088:4222?format=raw

LEADER: 04222cam a22003973i 4500
001 15765308
005 20221005125612.0
006 m o d
007 cr |n||||a||||
008 211105s2021 nyu|||| om 00| ||eng d
035 $a(OCoLC)1286685628
035 $a(OCoLC)on1286685628
035 $a(NNC)ACfeed:legacy_id:ac:q83bk3j9qz
035 $a(NNC)ACfeed:doi:10.7916/d8-h1rx-3n18
035 $a(NNC)15765308
040 $aNNC$beng$erda$cNNC
100 1 $aCorrea, Juan David.
245 12 $aA Computational Perspective of Causal Inference and the Data Fusion Problem /$cJuan David Correa.
264 1 $a[New York, N.Y.?] :$b[publisher not identified],$c2021.
336 $atext$btxt$2rdacontent
337 $acomputer$bc$2rdamedia
338 $aonline resource$bcr$2rdacarrier
300 $a1 online resource.
502 $aThesis (Ph.D.)--Columbia University, 2021.
500 $aDepartment: Computer Science.
500 $aThesis advisor: Elias Bareinboim.
520 $aThe ability to process and reason with causal information is fundamental in many aspects of human cognition and is pervasive in the way we probe reality in many of the empirical sciences. Given the centrality of causality through many aspects of human experience, we expect that the next generation of AI systems will need to represent causal knowledge, combine heterogeneous and biased datasets, and generalize across changing conditions and disparate domains to attain human-like intelligence. This dissertation investigates a problem in causal inference known as Data Fusion, which is concerned with inferring causal and statistical relationships from a combination of heterogeneous data collections from different domains, with various experimental conditions, and with nonrandom sampling (sampling selection bias). Despite the general conditions and algorithms developed so far for many aspects of the fusion problem, there are still significant aspects that are not well-understood and have not been studied together, as they appear in many challenging real-world applications.
520 $aSpecifically, this work advances our understanding of several dimensions of data fusion problems, which include the following capabilities and research questions: Reasoning with Soft Interventions. How to identify the effect of conditional and stochastic policies in a complex data fusion setting? Specifically, under what conditions can the effect of a new stochastic policy be evaluated using data from disparate sources and collected under different experimental conditions? Deciding Statistical Transportability. Under what conditions can statistical relationships (e.g., conditional distributions, classifiers) be extrapolated across disparate domains, where the target is somewhat related but not the same as the source domain where the data was initially collected? How to leverage additional data over a few variables in the target domain to help with the generalization process? Recovering from Selection Bias. How to determine whether a sample that was preferentially selected can be recovered so as to make a claim about the general underlying super-population? How can additional data over a subset of the variables, but sampled randomly, be used to achieve this goal? Instead of developing conditions and algorithms for each problem independently, this thesis introduces a computational framework capable of solving those research problems when appearing together.
520 $aThe approach decomposes the query and available heterogeneous distributions into factors with a canonical form. Then, the inference process is reduced to mapping the required factors to those available from the data, and then evaluating the query as a function of the input based on the mapping. The problems and methods discussed have several applications in the empirical sciences, statistics, machine learning, and artificial intelligence.
653 0 $aArtificial intelligence
653 0 $aComputer science
653 0 $aStochastic control theory--Statistical methods
653 0 $aStatistics
653 0 $aMachine learning
856 40 $uhttps://doi.org/10.7916/d8-h1rx-3n18$zClick for full text
852 8 $blweb$hDISSERTATIONS