-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Metric format #59
Metric format #59
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If speed is a concern, I would get rid of the BaseMetric
and register just simple functions. Creating classes has some overhead (not huge, but some), and having them here adds nothing.
If you want to make sure that the metric functions really accept the right arguments, you could use the inspect module. This runs when registering the function, not when running it, so it has no overhead at runtime.
@dalonsoa OK thanks - I've now implemented I think this is a cleaner solution, hopefully you agree! Thanks for the tips |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks excellent 👍
synthetic_G: nx.Graph, | ||
real_G: nx.Graph, | ||
**kwargs) -> float: | ||
"""Run the evaluated metric.""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cheginit it seems like we're happy with the format of metrics, but in the interest of not starting out with dud metrics, can I just check that comparing the distribution of nx.betweenness_centrality
of two graphs via a KS test is actually a semi reasonable thing to do?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's a loaded question 😄
First, regarding computing BC, networkx
can be very slow (computing BC is computationally expensive in general), that's why I use networkit
. Second, for comparing graphs in the context of optimization, there are more suitable metrics that we can choose from. For example, there is an interesting discussion here. You can also check out the distance measures or s-metric in networkx
. It appears that there's a new backend for networkx
that speeds up some slow operations in networkx
, called graphblas
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK that's super helpful - though I'm going to bring it over to #50 since that's probably the best place to discuss loaded questions about graph comparisons ;)
Other than the comments, it looks good to me. |
Fixes #54
Begins #49
OK this is a reasonably small PR, but quite an important one.
@dalonsoa for your context - we will run the model lots of times with different parameter values - and the thing that we will study in the paper (and something I hope many other potential users are going to be interested in using our package for) are various metrics. These metrics will in some way capture how good a job the synthetic network has done versus the real network. There are a few different 'categories' of metric (see #49 ) - e.g., whether comparing the 'shape' of the network, or flows at a critical location - etc. From a compute perspective, the metrics are essentially 'free information' about the method since, in comparison to the network derivation and simulation, they take basically no time to calculate (with the exception of some complicated graph metrics, we can discuss whether these are necessary in #50 ).
It would be great to get a review from both @dalonsoa and @cheginit on this one since ideally we will implement many many metrics, so I want to make sure they are done in a thought through format before commencing. In particular here one thing I'm not so sure on is the need for a
BaseMetric
- I was copying fromgraphfcn
so I included it. But there is no parameter that the metric has to take (unlikegraphfcn
, which needsG
), and instead a subset of 6 arguments that the metric might take (see below).This PR has two example metrics (
bias_flood_depth
andkstest_betweenness
) to demonstrate.As with
graphfcn
I have put here an example to implement metrics using a register. We would iterate overmetrics
(provided in some configuration file) and pass a few arguments, something like this: