Comments (4)
Thanks for bringing this up @dominikabasaj. This is definitely on the radar and we will be adding support for Streaming. I will encourage you to wear a PM hat and help us define the requirements/use cases/etc around this feature. This will help us validate what we are thinking and makes sure you get what you are looking for in this feature. CC: @itsvikramagr
from sparklens.
Here is one way to get it working with streaming job. I haven't tried it with streaming yet. Let me know if this serves your purpose.
1.Start your application with --packages qubole:sparklens:0.1.2-s_2.11
but don't specify the extraListener config.
2. As part of your application, do the following:
import com.qubole.sparklens.QuboleNotebookListener
val QNL = new QuboleNotebookListener(sc.getConf)
sc.addSparkListener(QNL)
Basically, create a listener(note that this is Notebook listener and not JobListener) and register it.
3. within your streaming function (whatever is repeatedly called), wrap your code in the following:
QNL.profileIt {
//Your code here
}
Alternatively, if you need more control:
if (QNL.estimateSize() > QNL.getMaxDataSize()) {
QNL.purgeJobsAndStages()
}
val startTime = System.currentTimeInMillis
<-- Your scala code here -->
endTime = System.currentTimeInMillis
//wait for some time to get all events to accumulate
Thread.sleep(QNL.getWaiTimeInSeconds())
println(QNL.getStats(startTime, endTime))
- Checkout https://github.com/qubole/sparklens/blob/master/src/main/scala/com/qubole/sparklens/QuboleNotebookListener.scala for more information.
thanks!
from sparklens.
Sorry for duplicating, but this issue is also related to streaming, so just thought of updating.
We have tried using QuboleJobListener for structured streaming , but it will only provide reports after terminating the streaming query and also it provides for all the Jobs together (not batch wise)
But in general, as these Structured streaming applications are continuously running, users/developers will be interested to see stats for every few batches.
Detailed proposal is attached as below. Please review and provide your inputs.
Structured_streaming_sparklens.pdf
from sparklens.
@dominikabasaj @akumarb2010
You can check out our new project Streaminglens if you plan to use Sparklens for Streaming applications.
from sparklens.
Related Issues (20)
- Not able to run spark lens on spark history file HOT 1
- Release for Scala 2.12 HOT 4
- support in spark 3.x version HOT 1
- The qubole#sparklens;0.3.2-s_2.11 module is intermittently not found in the SparkPackages repo HOT 5
- sparkles.qubole.com gets timed out and does not open. Not able to upload sparklers JSON file
- Emailing report feature Not Working - Unresponsive post
- resolver-fix
- Not able to see the sparklens.Json File at mentioned Location HOT 1
- pyspark can use sparklens? HOT 1
- Implementation Of StreamingLens Without Changing in Existing Code. HOT 1
- Implementation Of StreamingLens in Existing Spark Streaming Applications
- JAR version Issue while Implementing StreamingLense HOT 1
- Not Able to See StreamingLens Report In Logs.
- analysize spark eventhistory , but per stage metrics, max task mem is all 0.0KB HOT 1
- Error while opening PySpark shell with the package and conf on my local
- Report mail not working HOT 1
- Mismatch in Driverclock time in notbook function and report via sparkjson file
- py4j.protocol.Py4JJavaError: An error occurred while calling None.org.apache.spark.api.java.JavaSparkContext. : java.lang.NoClassDefFoundError: scala/Product$class
- Getting this error in my jupyter notebook Py4JJavaError
- Sparklens UI Not working HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sparklens.