softsys4ai / unicorn Goto Github PK
View Code? Open in Web Editor NEWA Framework for Reasoning about System Performance using Causal AI
License: MIT License
A Framework for Reasoning about System Performance using Causal AI
License: MIT License
Need to determine the transfer learning pipeline. Determine the following:
--- How good is the source modeling?
--- How much update is needed?
--- Explainability (what are the changes across environments)
--- Experiments with different source budgets
-- Use FCI with the entropic approach to resolving edges.
-- Breakdown computation efforts required for causal structure discovery, computing path causal effects, computing individual treatment effect, and measuring recommended configurations.
Use SparseYolov2 and SparseYolov3. Similar to previous actions. I Will update you with details.
Update ground truth for each fault by using the configurations that provide 80% or more gain and recompute accuracy, precision, and recall with a confidence interval.
Bubble/un-directed edge - selection variables
Bi-directed edge - hidden variables
@iqbal128855 List some previous papers that use these systems:
--- Performance analysis of the Facebook DLRM systems with different configurations. Show how difficult it is to debug for misconfigurations in real-world production systems and discuss challenges. Discuss the richness in performance landscape (more complex behavior).
--- Run CAUPER, BugDoc, SMAC, DeltaDebugging, Encore, and CBI on the DLRM fault dataset and evaluate using the ground truth dataset for both single and multi-objective performance faults.
--- Show proof of scalability of CAUPER in Facebook DLRM system with a high number of allowable values taken by different configuration options.
--- Write about the evaluation of Facebook DLRM systems. Analyze by 3 slices of latency, energy and heat.
Use Fig. 3 from here: https://www.bdti.com/InsideDSP/2017/03/14/NVIDIA to explain a real world scenario https://forums.developer.nvidia.com/t/cuda-performance-issue-on-tx2/50477 to show it works
Enrich the causal models with Functional Causal Model (FCM) using CGNN and work with visualization for FCM
Update causal model with Causal Interaction model and compare with CGNN.
Comparison of CGNN, FCI (entropic calculation), and Causal Interaction model.
If we use CGNN need to find the correct strategy -
--- how to find the initial skeleton?
Dear experts of Unicorn,
Thanks a lot for open-sourcing this excellent research. I have read your EuroSys paper and learned a lot! I just have two questions regarding the codebase due to my lack of knowledge:
# replace trail and undirected edges with single edges using entropic policy for i in range (len(PAG)): if trail_edge in PAG[i]: PAG[i]=PAG[i].replace(trail_edge, directed_edge) elif undirected_edge in PAG[i]: PAG[i]=PAG[i].replace(undirected_edge, directed_edge) else: continue for edge in PAG: cur = edge.split(" ") if cur[1]==directed_edge: node_one = self.colmap[int(cur[0].replace("X", ""))-1] node_two = self.colmap[int(cur[2].replace("X", ""))-1] options[node_one][directed_edge].append(node_two) elif cur[1]==bi_edge: node_one = self.colmap[int(cur[0].replace("X", ""))-1] node_two = self.colmap[int(cur[2].replace("X", ""))-1] options[node_one][bi_edge].append(node_two) else: print ("[ERROR]: unexpected edges")
I did find a function that computed the entropy for the EnCore method in the debugging_based.py. But maybe it is not the same. Could you point me to the right location of the entropy-based method that orients the undetermined edges of FCI? Thanks in advance!
Best regards,
Tianzhu
Method | Where? | When | link |
---|---|---|---|
∆LDA | ECML | 2007 | http://pages.cs.wisc.edu/~jerryzhu/ssl/pub/rlda.pdf |
SmartConf | ASPLOS | 2018 | https://people.cs.uchicago.edu/~hankhoffmann/autoconf.pdf |
BestConfig | SoCC | 2017 | https://arxiv.org/pdf/1710.03439.pdf |
LEO | SIGARCH | 2015 | https://dl.acm.org/doi/pdf/10.1145/2786763.2694373 |
Run MLPerf Benchmark with Facebook DLRM on different hardware (Jetson Xavier and TX2, Possibly on GPU cloud). Change software (RMC1, RMC2, and RMC3) and change workload (single stream, multi-stream and offline, varying number of queries for inference.)
Run MLPerf Benchmark with Google BERT + SQuAD 1.1 dataset on different hardware (Jetson Xavier and TX2, Possibly on GPU cloud)
Run MLPerf Benchmark with ResNet 50 + ImageNet on different hardware (Jetson Xavier and TX2, Possibly on GPU cloud).
Run MLPerf Benchmark with Facebook DLRM on different hardware (Jetson Xavier and TX2, Possibly on GPU cloud). Change software (RMC1, RMC2, and RMC3) and change workload (single stream, multi-stream and offline, varying number of queries for inference.)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.