remove dependency to ip2asn #31

romain-fontugne · 2024-12-20T09:41:35Z

This code is currently tricky to deploy because of its dependency to ip2asn. One way to fix this is to use IYP instead.

Steps:

remove code related to ip2asn
add code to query IYP to get prefix to ASN and prefix to IXP mappings

TejasNangru · 2024-12-23T03:21:44Z

@romain-fontugne,
I want to try this issue.

TejasNangru · 2024-12-24T03:44:44Z

@romain-fontugne
while setting up the code locally and downloading dependencies, i got this error for 'py-radix':
"ERROR: Failed building wheel for py-radix
Failed to build py-radix
ERROR: ERROR: Failed to build installable wheels for some pyproject.toml based projects (py-radix)"

romain-fontugne · 2024-12-24T05:46:45Z

I want to try this issue.

sure you can work on this. I can help you to query IYP, if you need help for that.

The error you have is due to a recent problem with py-radix. You can try to install it from the source, the git repo has a fix:
https://github.com/mjschultz/py-radix

TejasNangru · 2024-12-24T08:16:14Z

The error you have is due to a recent problem with py-radix. You can try to install it from the source, the git repo has a fix: https://github.com/mjschultz/py-radix

yeah i have installed py-radix from link you provided, but after that running the cmd: python3 setup.py build_ext --inplace
, i am getting this error and it requires c++ build tool, is there any Precompiled Binaries for this?

""running build_ext
Compiling raclette/tracksaggregator_cy.pyx because it changed.
[1/1] Cythonizing raclette/tracksaggregator_cy.pyx
building 'raclette.tracksaggregator_cy' extension
error: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/""

romain-fontugne · 2024-12-24T08:24:06Z

there is a cython module you have to compile with a C++ compiler. So yes you need that compiler

TejasNangru · 2024-12-24T13:31:54Z

While testing the functionality of the project, getting this:
"AssertionError: chunk_size should be smaller than window_size"
should i modify the value of chunk_size, if it does not create some future issues?

romain-fontugne · 2024-12-24T13:41:20Z

ah yes sorry, we should update the old configuration files (especially the ones mentioned in the readme).

In production we use chunk_size = 300

TejasNangru · 2024-12-25T06:03:46Z

after setting chunk_size value, getting this error:
" python raclette/raclette.py -C conf/asc-start.conf
type error: cannot pickle 'apsw.Connection' object
Traceback (most recent call last):
File "C:\Users\admin\OneDrive\Desktop\A\raclette\raclette\raclette.py", line 159, in main
saver.start()
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\process.py", line 121, in start
self._popen = self._Popen(self)
^^^^^^^^^^^^^^^^^
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\context.py", line 224, in _Popen
return _default_context.get_context().Process._Popen(process_obj)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\context.py", line 336, in _Popen
return Popen(process_obj)
^^^^^^^^^^^^^^^^^^
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\popen_spawn_win32.py", line 94, in init
reduction.dump(process_obj, to_child)
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\reduction.py", line 60, in dump
ForkingPickler(file, protocol).dump(obj)
TypeError: cannot pickle 'apsw.Connection' object

Traceback (most recent call last):
File "", line 1, in
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\spawn.py", line 111, in spawn_main
new_handle = reduction.duplicate(pipe_handle,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\admin\AppData\Local\Programs\Python\Python311\Lib\multiprocessing\reduction.py", line 79, in duplicate
return _winapi.DuplicateHandle(
^^^^^^^^^^^^^^^^^^^^^^^^
OSError: [WinError 6] The handle is invalid"

TejasNangru · 2024-12-25T06:04:50Z

ig, there is some problem in the code of file: sqlitesaver.py,

TejasNangru · 2024-12-27T03:26:12Z

@romain-fontugne

dpgiakatos · 2024-12-29T06:29:28Z

Hi, please try running the project on Linux instead of Windows, as the file paths are configured for a Linux environment.

TejasNangru · 2024-12-29T06:37:49Z

but i don't know, how to use linux
is there any other way?

roopeshsn · 2024-12-29T07:05:30Z

but i don't know, how to use linux is there any other way?

Try WSL

dpgiakatos · 2024-12-30T05:34:41Z

but i don't know, how to use linux is there any other way?

Here are two courses you can explore. The first is from the Linux Foundation and consists of about 60 hours of course material, covering a wide range of topics related to Linux. The second course is from freeCodeCamp and is approximately 6 hours long.

Since you are not yet familiar with the Linux environment, I recommend starting with the second course. It will provide you with a solid overview of Linux, including how to run applications and manage files. After completing the second course, I think you will be able to run this project in a Linux environment, and if you find yourself interested in Linux, you can continue with the first course for a more in-depth understanding.

Here are the links to the courses:

First course: Introduction to Linux - Linux Foundation
Second course: Linux for Beginners - freeCodeCamp

TejasNangru · 2024-12-30T10:19:23Z

but i don't know, how to use linux is there any other way?

Here are two courses you can explore. The first is from the Linux Foundation and consists of about 60 hours of course material, covering a wide range of topics related to Linux. The second course is from freeCodeCamp and is approximately 6 hours long.

Since you are not yet familiar with the Linux environment, I recommend starting with the second course. It will provide you with a solid overview of Linux, including how to run applications and manage files. After completing the second course, I think you will be able to run this project in a Linux environment, and if you find yourself interested in Linux, you can continue with the first course for a more in-depth understanding.

Here are the links to the courses:

First course: Introduction to Linux - Linux Foundation

Second course: Linux for Beginners - freeCodeCamp

ok sir

TejasNangru · 2025-01-06T17:30:14Z

@dpgiakatos @romain-fontugne I am now using WSL for linux environment, but while deploying the code locally, i am now facing this error:
"type error: name 'KAFKA_HOST' is not defined
Traceback (most recent call last):
File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/raclette.py", line 174, in main
i2a = ip2asn.ip2asn(self.ip2asn_db, self.ip2asn_ixp, self.ip2asn_kafka_topic, KAFKA_HOST)
^^^^^^^^^^
NameError: name 'KAFKA_HOST' is not defined"

I want to know if this is reason the current code is tricky to deploy?
if not, then please give the solution to this error.

TejasNangru · 2025-01-06T17:42:48Z

Could you please guide me on how to query IYP to get prefix to ASN and prefix to IXP mappings? I’m encountering some difficulty in understanding how to integrate it into the project as a replacement for ip2asn.

dpgiakatos · 2025-01-07T01:24:27Z

Hi, please install Kafka locally and then define the KAFKA_HOST.

To obtain the prefix-to-ASN and prefix-to-IXP mappings, we currently use a pickle file, as I observe in the source code. My understanding is that we aim to replace this logic with IYP queries. IYP refers to our Internet Yellow Pages project, and you can refer to the corresponding documentation for guidance on querying the IYP.

Here are the Cypher queries to assist you:

Prefix to ASN:

MATCH (a:AS)-[:ORIGINATE]-(p:Prefix)
RETURN p.prefix, a.asn

Prefix to IXP:

MATCH (p:Prefix)-[:MANAGED_BY]-(i:IXP)
RETURN p.prefix, i.name

TejasNangru · 2025-01-07T06:27:56Z

if, dependancy on ip2asn is causing kafka issue
can i directly work on Query IYP instead of kafka?

TejasNangru · 2025-01-07T08:04:49Z

and does it require an id password for query IYP?

dpgiakatos · 2025-01-07T09:01:18Z

The functionality of the code should not be changed. You need to locate the code within the ip2asn that needs to be replaced with the IYP query. From Kafka, you will retrieve the prefix, so you must use it.

It is important to understand how the code works before making any changes, as the functionality must remain the same. Therefore, first, try to run the code locally without any errors, and then work on understanding the data.

You do not need credentials to log in to the IYP; the username and password are empty.

P.S. For Kafka support you can tag the @InternetHealthReport/iij team.

TejasNangru · 2025-01-08T12:37:27Z

Sir, I have installed and set up the kafka now and its working,
but now i am getting this:

python3 raclette/raclette.py -C conf/asc-start.conf
type error: Invalid data stream
Traceback (most recent call last):
File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/raclette.py", line 175, in main
i2a = ip2asn.ip2asn(self.ip2asn_db, self.ip2asn_ixp, self.ip2asn_kafka_topic, KAFKA_HOST)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/lib/ip2asn.py", line 49, in init
self.rtree = pickle.load(bz2.BZ2File(db, "rb"))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.12/bz2.py", line 155, in peek
return self._buffer.peek(n)
^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.12/_compression.py", line 68, in readinto
data = self.read(len(byte_view))
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.12/_compression.py", line 103, in read
data = self._decompressor.decompress(rawblock, size)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
OSError: Invalid data stream

dpgiakatos · 2025-01-08T14:49:02Z

The error you are encountering is an OSError related to the bz2 module. This happens as the bz2 module may not be properly installed to your OS. This below code will help you verify whether the bz2 module is functioning correctly on your OS.

import pickle
import bz2

# Create a sample object and save it to a BZ2 file
data = {'key': 'value'}
with bz2.BZ2File('sample_data.bz2', 'wb') as f:
    pickle.dump(data, f)

# Read the BZ2 file
with bz2.BZ2File('sample_data.bz2', 'rb') as f:
    loaded_data = pickle.load(f)
    print(loaded_data)  # Should output: {'key': 'value'}

TejasNangru · 2025-01-09T07:02:29Z

In the original codebase, there is this value undefined.

due to this i am getting this error:
type error: [Errno 2] No such file or directory: ''
Traceback (most recent call last):
File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/raclette.py", line 175, in main
i2a = ip2asn.ip2asn(self.ip2asn_db, self.ip2asn_ixp, self.ip2asn_kafka_topic, KAFKA_HOST)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/lib/ip2asn.py", line 53, in init
with open(ixp) as fi:
^^^^^^^^^
FileNotFoundError: [Errno 2] No such file or directory: ''

dpgiakatos · 2025-01-10T01:06:32Z

@InternetHealthReport/iij can you help with the above variable?

romain-fontugne · 2025-01-10T02:27:44Z

in the data/ folder there is a file 'ixs_202310.jsonl'

you can just put data/ixs_202310.jsonl for the ip2asn_ixp value

romain-fontugne · 2025-01-10T02:32:48Z

Sir, I have installed and set up the kafka now and its working, but now i am getting this:

python3 raclette/raclette.py -C conf/asc-start.conf type error: Invalid data stream Traceback (most recent call last): File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/raclette.py", line 175, in main i2a = ip2asn.ip2asn(self.ip2asn_db, self.ip2asn_ixp, self.ip2asn_kafka_topic, KAFKA_HOST) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/mnt/c/Users/admin/OneDrive/Desktop/IHR/raclette/raclette/lib/ip2asn.py", line 49, in init self.rtree = pickle.load(bz2.BZ2File(db, "rb")) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.12/bz2.py", line 155, in peek return self._buffer.peek(n) ^^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.12/_compression.py", line 68, in readinto data = self.read(len(byte_view)) ^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.12/_compression.py", line 103, in read data = self._decompressor.decompress(rawblock, size) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ OSError: Invalid data stream

I'll push a quick fix for that. For the configuration file you are using you shouldn't need kafka

romain-fontugne · 2025-01-10T02:47:26Z

It should be fixed now.
Actually removing the kafka part in ip2asn (lib/ip2asn.py file) would be good because we are not using that.

TejasNangru · 2025-02-06T11:44:58Z

hi guys
now i am not working on this issue
if some new comers want to try this, feel free to try.

dpgiakatos mentioned this issue Jan 14, 2025

Navigation Links Not Landing on Target Section InternetHealthReport/ihr-website#885

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove dependency to ip2asn #31

remove dependency to ip2asn #31

romain-fontugne commented Dec 20, 2024

TejasNangru commented Dec 23, 2024

TejasNangru commented Dec 24, 2024

romain-fontugne commented Dec 24, 2024 •

edited

Loading

TejasNangru commented Dec 24, 2024 •

edited

Loading

romain-fontugne commented Dec 24, 2024

TejasNangru commented Dec 24, 2024 •

edited

Loading

romain-fontugne commented Dec 24, 2024

TejasNangru commented Dec 25, 2024

TejasNangru commented Dec 25, 2024

TejasNangru commented Dec 27, 2024

dpgiakatos commented Dec 29, 2024

TejasNangru commented Dec 29, 2024

roopeshsn commented Dec 29, 2024

dpgiakatos commented Dec 30, 2024

TejasNangru commented Dec 30, 2024

TejasNangru commented Jan 6, 2025

TejasNangru commented Jan 6, 2025

dpgiakatos commented Jan 7, 2025

TejasNangru commented Jan 7, 2025

TejasNangru commented Jan 7, 2025

dpgiakatos commented Jan 7, 2025

TejasNangru commented Jan 8, 2025

dpgiakatos commented Jan 8, 2025

TejasNangru commented Jan 9, 2025 •

edited

Loading

dpgiakatos commented Jan 10, 2025

romain-fontugne commented Jan 10, 2025

romain-fontugne commented Jan 10, 2025

romain-fontugne commented Jan 10, 2025

TejasNangru commented Feb 6, 2025

remove dependency to ip2asn #31

remove dependency to ip2asn #31

Comments

romain-fontugne commented Dec 20, 2024

TejasNangru commented Dec 23, 2024

TejasNangru commented Dec 24, 2024

romain-fontugne commented Dec 24, 2024 • edited Loading

TejasNangru commented Dec 24, 2024 • edited Loading

romain-fontugne commented Dec 24, 2024

TejasNangru commented Dec 24, 2024 • edited Loading

romain-fontugne commented Dec 24, 2024

TejasNangru commented Dec 25, 2024

TejasNangru commented Dec 25, 2024

TejasNangru commented Dec 27, 2024

dpgiakatos commented Dec 29, 2024

TejasNangru commented Dec 29, 2024

roopeshsn commented Dec 29, 2024

dpgiakatos commented Dec 30, 2024

TejasNangru commented Dec 30, 2024

TejasNangru commented Jan 6, 2025

TejasNangru commented Jan 6, 2025

dpgiakatos commented Jan 7, 2025

TejasNangru commented Jan 7, 2025

TejasNangru commented Jan 7, 2025

dpgiakatos commented Jan 7, 2025

TejasNangru commented Jan 8, 2025

dpgiakatos commented Jan 8, 2025

TejasNangru commented Jan 9, 2025 • edited Loading

dpgiakatos commented Jan 10, 2025

romain-fontugne commented Jan 10, 2025

romain-fontugne commented Jan 10, 2025

romain-fontugne commented Jan 10, 2025

TejasNangru commented Feb 6, 2025

romain-fontugne commented Dec 24, 2024 •

edited

Loading

TejasNangru commented Dec 24, 2024 •

edited

Loading

TejasNangru commented Dec 24, 2024 •

edited

Loading

TejasNangru commented Jan 9, 2025 •

edited

Loading