class documentation

A source pointing to a Parquet file

Class Method multi_source_features_for Undocumented
Method __hash__ Undocumented
Method all_between_dates Undocumented
Method all_data Undocumented
Async Method delete Undocumented
Async Method feature_view_code Setup the code needed to represent the data source as a feature view
Async Method insert Undocumented
Method job_group_key A key defining which sources can be grouped together in one request.
Async Method overwrite Undocumented
Async Method schema Returns the schema for the data source
Async Method to_lazy_polars Undocumented
Async Method to_pandas Undocumented
Async Method upsert Undocumented
Method with_view Undocumented
Async Method write_polars Undocumented
Class Variable config Undocumented
Class Variable date_formatter Undocumented
Class Variable directory Undocumented
Class Variable mapping_keys Undocumented
Class Variable partition_keys Undocumented
Class Variable type_name Undocumented
Property to_markdown Undocumented

Inherited from CodableBatchDataSource:

Class Method _deserialize Undocumented
Method _serialize Undocumented

Inherited from BatchDataSource (via CodableBatchDataSource):

Method all Undocumented
Method all_columns Undocumented
Method depends_on Undocumented
Method features_for Undocumented
Method filter Undocumented
Async Method freshness .table("my_table") .freshness()
Method location_id Undocumented
Method source_id An id that identifies a source from others.
Method tags Undocumented
Method transform_with_polars Undocumented
Method with_loaded_at Undocumented

Inherited from ColumnFeatureMappable (via CodableBatchDataSource, BatchDataSource):

Method columns_for Undocumented
Method feature_identifier_for Undocumented
Method with_renames Undocumented

Inherited from DataFileReference (via CodableBatchDataSource, BatchDataSource, ColumnFeatureMappable):

Async Method read_pandas Undocumented
Async Method to_polars Undocumented
Async Method write_pandas Undocumented
@classmethod
def multi_source_features_for(cls, facts: RetrivalJob, requests: list[tuple[ParquetFileSource, RetrivalRequest]]) -> RetrivalJob: (source)
def all_between_dates(self, request: RetrivalRequest, start_date: datetime, end_date: datetime) -> RetrivalJob: (source)
def all_data(self, request: RetrivalRequest, limit: int | None) -> RetrivalJob: (source)
async def delete(self): (source)
async def feature_view_code(self, view_name: str) -> str: (source)

Setup the code needed to represent the data source as a feature view

```python FileSource.parquet("my_path.parquet").feature_view_code(view_name="my_view")

>>> """from aligned import FeatureView, String, Int64, Float

class MyView(FeatureView):

metadata = FeatureView.metadata_with(
name="Embarked", description="some description", batch_source=FileSource.parquest("my_path.parquet") stream_source=None,

)

Passenger_id = Int64() Survived = Int64() Pclass = Int64() Name = String() Sex = String() Age = Float() Sibsp = Int64() Parch = Int64() Ticket = String() Fare = Float() Cabin = String() Embarked = String()"""

```

Returns:
str: The code needed to setup a basic feature view
async def insert(self, job: RetrivalJob, request: RetrivalRequest): (source)
def job_group_key(self) -> str: (source)

A key defining which sources can be grouped together in one request.

async def overwrite(self, job: RetrivalJob, request: RetrivalRequest): (source)
async def schema(self) -> dict[str, FeatureType]: (source)

Returns the schema for the data source

`python source = FileSource.parquet_at('test_data/titanic.parquet') schema = await source.schema() >>> {'passenger_id': FeatureType(name='int64'), ...} `

Returns:
dict[str, FeatureType]: A dictionary containing the column name and the feature type
async def to_lazy_polars(self) -> pl.LazyFrame: (source)
async def to_pandas(self) -> pd.DataFrame: (source)
async def upsert(self, job: RetrivalJob, request: RetrivalRequest): (source)
def with_view(self, view: CompiledFeatureView) -> PartitionedParquetFileSource: (source)
async def write_polars(self, df: pl.LazyFrame): (source)

Undocumented

date_formatter: DateFormatter = (source)

Undocumented

directory: str = (source)

Undocumented

partition_keys: list[str] = (source)

Undocumented

@property
to_markdown: str = (source)

Undocumented