trinity.buffer
Subpackages
- trinity.buffer.reader
- trinity.buffer.schema
- trinity.buffer.writer
Submodules
trinity.buffer.buffer module
The buffer module
- trinity.buffer.buffer.get_buffer_reader(storage_config: StorageConfig, buffer_config: BufferConfig) BufferReader [source]
Get a buffer reader for the given dataset name.
- trinity.buffer.buffer.get_buffer_writer(storage_config: StorageConfig, buffer_config: BufferConfig) BufferWriter [source]
Get a buffer writer for the given dataset name.
trinity.buffer.buffer_reader module
Reader of the buffer.
- class trinity.buffer.buffer_reader.BufferReader[source]
Bases:
ABC
Interface of the buffer reader.
- abstract read(batch_size: int | None = None, strategy: ReadStrategy | None = None) List [source]
Read from buffer.
trinity.buffer.buffer_writer module
Writer of the buffer.
trinity.buffer.queue module
A queue implemented by Ray Actor.
- class trinity.buffer.queue.QueueActor(storage_config: StorageConfig, config: BufferConfig)[source]
Bases:
object
An asyncio.Queue based queue actor.
- FINISH_MESSAGE = '$FINISH$'
- __init__(storage_config: StorageConfig, config: BufferConfig) None [source]
- classmethod get_actor(storage_config: StorageConfig, config: BufferConfig)[source]
Get the queue actor.
trinity.buffer.ray_wrapper module
- class trinity.buffer.ray_wrapper.DBWrapper(storage_config: StorageConfig, config: BufferConfig)[source]
Bases:
object
A wrapper of a SQL database.
If wrap_in_ray in StorageConfig is True, this class will be run as a Ray Actor, and provide a remote interface to the local database.
For databases that do not support multi-processing read/write (e.g. sqlite, duckdb), we recommend setting wrap_in_ray to True
- __init__(storage_config: StorageConfig, config: BufferConfig) None [source]
- classmethod get_wrapper(storage_config: StorageConfig, config: BufferConfig)[source]
- read(batch_size: int | None = None, strategy: ReadStrategy | None = None) List [source]
- class trinity.buffer.ray_wrapper.FileWrapper(storage_config: StorageConfig, config: BufferConfig)[source]
Bases:
object
A wrapper of a local jsonl file.
If wrap_in_ray in StorageConfig is True, this class will be run as a Ray Actor, and provide a remote interface to the local file.
This wrapper is only for writing, if you want to read from the file, use StorageType.QUEUE instead.
- __init__(storage_config: StorageConfig, config: BufferConfig) None [source]
- classmethod get_wrapper(storage_config: StorageConfig, config: BufferConfig)[source]
trinity.buffer.utils module
- trinity.buffer.utils.retry_session(session_maker, max_retry_times: int, max_retry_interval: float)[source]
A Context manager for retrying session.
- trinity.buffer.utils.default_storage_path(storage_config: StorageConfig, buffer_config: BufferConfig) str [source]
Module contents
- trinity.buffer.get_buffer_reader(storage_config: StorageConfig, buffer_config: BufferConfig) BufferReader [source]
Get a buffer reader for the given dataset name.
- trinity.buffer.get_buffer_writer(storage_config: StorageConfig, buffer_config: BufferConfig) BufferWriter [source]
Get a buffer writer for the given dataset name.