Input/Output Reference

This section provides a detailed API reference for all modules related to data input, output, and framework interoperability in the datarec library.

Core I/O Modules

These modules handle the fundamental tasks of reading, writing, and representing raw data.

`RawData`

Container for raw datasets in DataRec.

Wraps a pandas.DataFrame and stores metadata about user, item, rating, and timestamp columns. Provides lightweight methods for slicing, copying, and merging data.

Source code in datarec/io/rawdata.py

class RawData:
    """
    Container for raw datasets in DataRec.

    Wraps a `pandas.DataFrame` and stores metadata about user, item, rating, and timestamp columns.
    Provides lightweight methods for slicing, copying, and merging data.
    """
    def __init__(self, data=None, header=False, user=None, item=None, rating=None, timestamp=None):
        """
        Initialize a RawData object.

        Args:
            data (pd.DataFrame): DataFrame of the dataset. Defaults to None.
            header (bool): Whether the file has a header. Defaults to False.
            user (str): Column name for user IDs.
            item (str): Column name for item IDs.
            rating (str): Column name for ratings.
            timestamp (str): Column name for timestamps.
        """
        self.data = data
        self.header = header
        if data is None:
            self.data = pd.DataFrame
            self.header = header
        self.path = None

        self.user = user
        self.item = item
        self.rating = rating
        self.timestamp = timestamp

    def append(self, new_data):
        """
        Append new rows to the dataset.

        Args:
            new_data (pd.DataFrame): DataFrame to append.

        Returns:
            None
        """
        self.data.append(new_data)

    def copy(self, deep=True):
        """
        Make a copy of the dataset.

        Args:
            deep (bool): If True, return a deep copy of the dataset.

        Returns:
            (RawData): A copy of the dataset.

        """
        self.data.copy(deep=deep)

    def __repr__(self):
        """
        Return a string representation of the dataset.
        """
        return repr(self.data)

    def __len__(self):
        """
        Return the length of the dataset.
        """
        return len(self.data)

    def __getitem__(self, idx):
        """
        Return the item at the given index.
        Args:
            idx: index of the item to return.

        Returns:
            (RawData): the sample at the given index.

        """
        return self.data[idx]

    def __add__(self, other):
        """
        Concatenate two RawData objects.
        Args:
            other (RawData): the other RawData to concatenate.

        Returns:
            (RawData): the concatenated RawData object.

        """
        self.__check_rawdata_compatibility__(other)
        new_data = pd.concat([self.data, other.data])
        new_rawdata = RawData(new_data, user=self.user, item=self.item, rating=self.rating,
                              timestamp=self.timestamp, header=self.header)
        return new_rawdata

    def __iter__(self):
        """
        Iterate over dataset rows.

        Returns:
            (pd.Series): Each row in the dataset.

        """
        return iter(self.data)

    def __check_rawdata_compatibility__(self, rawdata):
        """
        Check compatibility between RawData objects.
        Args:
            rawdata (RawData): RawData object to check.

        Returns:
            (bool): True if compatibility is verified.

        """
        return __check_rawdata_compatibility__(self, rawdata)

`init(data=None, header=False, user=None, item=None, rating=None, timestamp=None)`

Initialize a RawData object.

Parameters:

Name	Type	Description	Default
`data`	`DataFrame`	DataFrame of the dataset. Defaults to None.	`None`
`header`	`bool`	Whether the file has a header. Defaults to False.	`False`
`user`	`str`	Column name for user IDs.	`None`
`item`	`str`	Column name for item IDs.	`None`
`rating`	`str`	Column name for ratings.	`None`
`timestamp`	`str`	Column name for timestamps.	`None`

Source code in datarec/io/rawdata.py

def __init__(self, data=None, header=False, user=None, item=None, rating=None, timestamp=None):
    """
    Initialize a RawData object.

    Args:
        data (pd.DataFrame): DataFrame of the dataset. Defaults to None.
        header (bool): Whether the file has a header. Defaults to False.
        user (str): Column name for user IDs.
        item (str): Column name for item IDs.
        rating (str): Column name for ratings.
        timestamp (str): Column name for timestamps.
    """
    self.data = data
    self.header = header
    if data is None:
        self.data = pd.DataFrame
        self.header = header
    self.path = None

    self.user = user
    self.item = item
    self.rating = rating
    self.timestamp = timestamp

`append(new_data)`

Append new rows to the dataset.

Parameters:

Name	Type	Description	Default
`new_data`	`DataFrame`	DataFrame to append.	required

Returns:

Type	Description
	None

Source code in datarec/io/rawdata.py

def append(self, new_data):
    """
    Append new rows to the dataset.

    Args:
        new_data (pd.DataFrame): DataFrame to append.

    Returns:
        None
    """
    self.data.append(new_data)

`copy(deep=True)`

Make a copy of the dataset.

Parameters:

Name	Type	Description	Default
`deep`	`bool`	If True, return a deep copy of the dataset.	`True`

Returns:

Type	Description
`RawData`	A copy of the dataset.

Source code in datarec/io/rawdata.py

def copy(self, deep=True):
    """
    Make a copy of the dataset.

    Args:
        deep (bool): If True, return a deep copy of the dataset.

    Returns:
        (RawData): A copy of the dataset.

    """
    self.data.copy(deep=deep)

`repr()`

Return a string representation of the dataset.

Source code in datarec/io/rawdata.py

def __repr__(self):
    """
    Return a string representation of the dataset.
    """
    return repr(self.data)

`len()`

Return the length of the dataset.

Source code in datarec/io/rawdata.py

def __len__(self):
    """
    Return the length of the dataset.
    """
    return len(self.data)

`getitem(idx)`

Return the item at the given index. Args: idx: index of the item to return.

Returns:

Type	Description
`RawData`	the sample at the given index.

Source code in datarec/io/rawdata.py

def __getitem__(self, idx):
    """
    Return the item at the given index.
    Args:
        idx: index of the item to return.

    Returns:
        (RawData): the sample at the given index.

    """
    return self.data[idx]

`add(other)`

Concatenate two RawData objects. Args: other (RawData): the other RawData to concatenate.

Returns:

Type	Description
`RawData`	the concatenated RawData object.

Source code in datarec/io/rawdata.py

def __add__(self, other):
    """
    Concatenate two RawData objects.
    Args:
        other (RawData): the other RawData to concatenate.

    Returns:
        (RawData): the concatenated RawData object.

    """
    self.__check_rawdata_compatibility__(other)
    new_data = pd.concat([self.data, other.data])
    new_rawdata = RawData(new_data, user=self.user, item=self.item, rating=self.rating,
                          timestamp=self.timestamp, header=self.header)
    return new_rawdata

`iter()`

Iterate over dataset rows.

Returns:

Type	Description
`Series`	Each row in the dataset.

Source code in datarec/io/rawdata.py

def __iter__(self):
    """
    Iterate over dataset rows.

    Returns:
        (pd.Series): Each row in the dataset.

    """
    return iter(self.data)

`__check_rawdata_compatibility__(rawdata)`

Check compatibility between RawData objects. Args: rawdata (RawData): RawData object to check.

Returns:

Type	Description
`bool`	True if compatibility is verified.

Source code in datarec/io/rawdata.py

def __check_rawdata_compatibility__(self, rawdata):
    """
    Check compatibility between RawData objects.
    Args:
        rawdata (RawData): RawData object to check.

    Returns:
        (bool): True if compatibility is verified.

    """
    return __check_rawdata_compatibility__(self, rawdata)

`__check_rawdata_compatibility__(rawdata1, rawdata2)`

Check compatibility between two RawData objects. Args: rawdata1 (RawData): First RawData object to check. rawdata2 (RawData): Second RawData object to check.

Returns:

Type	Description
`bool`	True if compatibility is verified.

Source code in datarec/io/rawdata.py

def __check_rawdata_compatibility__(rawdata1: RawData, rawdata2: RawData):
    """
    Check compatibility between two RawData objects.
    Args:
        rawdata1 (RawData): First RawData object to check.
        rawdata2 (RawData): Second RawData object to check.

    Returns:
        (bool): True if compatibility is verified.

    """
    if rawdata1.user != rawdata2.user:
        raise ValueError('User columns are not compatible')
    if rawdata1.item != rawdata2.item:
        raise ValueError('Item columns are not compatible')
    if rawdata1.rating != rawdata2.rating:
        raise ValueError('Rating columns are not compatible')
    if rawdata1.timestamp != rawdata2.timestamp:
        raise ValueError('Timestamp columns are not compatible')
    if rawdata1.header != rawdata2.header:
        raise ValueError('Header is not compatible')
    return True

`fill_rawdata(data, user=None, item=None, rating=None, timestamp=None, path=None)`

Create a RawData object from raw data and assign column names to RawData object attributes.

Parameters:

Name	Type	Description	Default
`data`	`DataFrame`	Data to create RawData object from.	required
`user`	`str`	Column name for user field.	`None`
`item`	`str`	Column name for item field.	`None`
`rating`	`str`	Column name for rating field.	`None`
`timestamp`	`str`	Column name for timestamp field.	`None`
`path`	`str`	Path where the original file is stored.	`None`

Source code in datarec/io/readers.py

def fill_rawdata(data, user=None, item=None, rating=None, timestamp=None, path=None):
    """
    Create a RawData object from raw data and assign column names to RawData object attributes.

    Args:
        data (pd.DataFrame): Data to create RawData object from.
        user (str): Column name for user field.
        item (str): Column name for item field.
        rating (str): Column name for rating field.
        timestamp (str): Column name for timestamp field.
        path (str): Path where the original file is stored.


    """
    rawdata = RawData(data)

    # set columns
    rawdata.user = user
    rawdata.item = item
    rawdata.rating = rating
    rawdata.timestamp = timestamp

    # set file path
    rawdata.path = path

`read_json(filepath, user_field=None, item_field=None, rating_field=None, timestamp_field=None, lines=True)`

Reads a JSON file and returns it as a RawData object. Args: filepath (str): path to JSON file. user_field (str): JSON key for user field. item_field (str): JSON key for item field. rating_field (str): JSON key for rating field. timestamp_field (str): JSON key for timestamp field. lines (bool): Read the file as a JSON object per line.

Returns:

Type	Description
`RawData`	RawData object

Source code in datarec/io/readers.py

def read_json(filepath, user_field=None, item_field=None, rating_field=None, timestamp_field=None, lines=True):
    """
    Reads a JSON file and returns it as a RawData object.
    Args:
        filepath (str): path to JSON file.
        user_field (str): JSON key for user field.
        item_field (str): JSON key for item field.
        rating_field (str): JSON key for rating field.
        timestamp_field (str): JSON key for timestamp field.
        lines (bool): Read the file as a JSON object per line.

    Returns:
        (RawData): RawData object

    """
    # check that file exists
    if os.path.exists(filepath) is False:
        raise FileNotFoundError

    std_fields = [user_field, item_field, rating_field, timestamp_field]
    assigned_fields = [c for c in std_fields if c is not None]

    # at least one column given check
    if len(assigned_fields) == 0:
        raise AttributeError('Fields are missing. At least one should be assigned')

    # read data
    data = pd.read_json(filepath, lines=lines)

    # check that columns are aligned
    for c in assigned_fields:
        if c not in data.columns:
            raise ValueError(f'Field {c} not found in the dataset. Please, check the value and retry')

    rawdata = RawData(data[assigned_fields])

    # set columns
    rawdata.user = user_field if user_field is not None else None
    rawdata.item = item_field if item_field is not None else None
    rawdata.rating = rating_field if rating_field is not None else None
    rawdata.timestamp = timestamp_field if timestamp_field is not None else None
    return rawdata

`read_tabular(filepath, sep, user_col=None, item_col=None, rating_col=None, timestamp_col=None, header='infer', skiprows=0)`

Reads a tabular data file and returns it as a pandas DataFrame. Args: filepath (str): Path to tabular data file. sep (str): Separator to use. user_col (str): Column name for user field. item_col (str): Column name for item field. rating_col (str): Column name for rating field. timestamp_col (str): Column name for timestamp field. header (nt, Sequence of int, ‘infer’ or None): Row number(s) containing column labels and marking the start of the data (zero-indexed). Default behavior is to infer the column names. skiprows (int, list of int or Callable): Line numbers to skip (0-indexed) or number of lines to skip (int) at the start of the file.

Returns:

Type	Description
`RawData`	RawData object.

Source code in datarec/io/readers.py

def read_tabular(filepath: str, sep: str, user_col=None, item_col=None, rating_col=None, timestamp_col=None,
                 header="infer", skiprows=0):
    """
    Reads a tabular data file and returns it as a pandas DataFrame.
    Args:
        filepath (str): Path to tabular data file.
        sep (str): Separator to use.
        user_col (str): Column name for user field.
        item_col (str): Column name for item field.
        rating_col (str): Column name for rating field.
        timestamp_col (str): Column name for timestamp field.
        header (nt, Sequence of int, ‘infer’ or None): Row number(s) containing column labels and marking the start of the data (zero-indexed). Default behavior is to infer the column names.
        skiprows (int, list of int or Callable): Line numbers to skip (0-indexed) or number of lines to skip (int) at the start of the file.

    Returns:
        (RawData): RawData object.

    """
    # check that file exists
    if os.path.exists(filepath) is False:
        raise FileNotFoundError

    std_columns = [user_col, item_col, rating_col, timestamp_col]
    assigned_columns = [c for c in std_columns if c is not None]

    # at least one column given check
    if len(assigned_columns) == 0:
        raise AttributeError('Columns are missing. At least one should be assigned')

    # read data
    data = pd.read_table(filepath_or_buffer=filepath, sep=sep, header=header, skiprows=skiprows, engine='python')

    # check that columns are aligned
    for c in assigned_columns:
        if c not in data.columns:
            raise ValueError(f'Column {c} not found in the dataset. Please, check the value and retry')

    rawdata = RawData(data=data[assigned_columns])

    # set columns
    rawdata.user = user_col if (user_col is not None) else None
    rawdata.item = item_col if item_col is not None else None
    rawdata.rating = rating_col if rating_col is not None else None
    rawdata.timestamp = timestamp_col if timestamp_col is not None else None

    return rawdata

`read_inline(filepath, cols=None, user_col='user', item_col='item', col_sep=',', history_sep=';')`

Read a CSV file and return a RawData object. Args: filepath (str): Path to CVS file. cols (list[str]): List of column names. user_col (str): Column name for user field.: item_col (str): Column name for item field. col_sep (str): Separator to use. history_sep (str): Separator for multiple items.

Returns:

Type	Description
`RawData`	RawData object.

Source code in datarec/io/readers.py

def read_inline(filepath: str, cols=None, user_col='user', item_col='item', col_sep=',', history_sep=';'):
    """
    Read a CSV file and return a RawData object.
    Args:
        filepath (str): Path to CVS file.
        cols (list[str]): List of column names.
        user_col (str): Column name for user field.:
        item_col (str): Column name for item field.
        col_sep (str): Separator to use.
        history_sep (str): Separator for multiple items.

    Returns:
        (RawData): RawData object.

    """
    if cols is None:
        cols = ['user', 'item']
    assert os.path.exists(filepath), f'File not found at {filepath}'
    to_drop_cols = [c for c in cols if c not in (user_col, item_col)]

    data = pd.read_csv(filepath, sep=col_sep, header=None, names=cols)
    data = data.dropna(subset=['user', 'item'])
    data = data.drop(columns=to_drop_cols)
    data[item_col] = data[item_col].apply(lambda x: [item.strip() for item in x.split(history_sep)])
    data = data.explode('item')
    data = data.reset_index(drop=True)
    return RawData(data, user='user', item='item')

`read_inline_chunk(filepath, cols=None, user_col='user', item_col='item')`

Read a CSV file a chunk of rows at a time and return a RawData object. Args: filepath (str): Path to CSV file. cols (list[str]): List of column names. user_col (str): Column name for user field. item_col (str): Column name for item field.

Returns:

Type	Description
`RawData`	RawData object.

Source code in datarec/io/readers.py

def read_inline_chunk(filepath: str, cols=None, user_col='user', item_col='item'):
    """
    Read a CSV file a chunk of rows at a time and return a RawData object.
    Args:
        filepath (str): Path to CSV file.
        cols (list[str]): List of column names.
        user_col (str): Column name for user field.
        item_col (str): Column name for item field.

    Returns:
        (RawData): RawData object.

    """
    if cols is None:
        cols = ['user', 'item']
    assert os.path.exists(filepath), f'File not found at {filepath}'
    to_drop_cols = [c for c in cols if c not in (user_col, item_col)]

    data_chunks = pd.read_csv(filepath, sep=',', header=None, names=cols, chunksize=100000)
    data = None

    for chunk in tqdm.tqdm(data_chunks):
        chunk = chunk.drop(columns=to_drop_cols)
        chunk[item_col] = chunk[item_col].apply(lambda x: [item.strip() for item in x.split(';')])
        chunk = chunk.explode('item')
        if data is not None:
            data = pd.concat([data, chunk])
        else:
            data = chunk

    data = data.reset_index(drop=True)
    return RawData(data, user='user', item='item')

`write_tabular(rawdata, path, sep='\t', header=True, decimal='.', user=True, item=True, rating=True, timestamp=True, verbose=True)`

Write a RawData dataset to a CSV/TSV file.

Parameters:

Name	Type	Description	Default
`rawdata`	`RawData`	RawData instance.	required
`path`	`str`	Path to the CSV/TSV file.	required
`sep`	`str`	Separator to use.	`'\t'`
`header`	`bool or list[str]`	Write out the column names. If a list of strings is given it is assumed to be aliases for the column names.	`True`
`decimal`	`str`	Character recognized as decimal separator.	`'.'`
`user`	`bool`	Whether to write the user information. If True, the user information will be written in the file.	`True`
`item`	`bool`	Whether to write the item information. If True, the item information will be written in the file.	`True`
`rating`	`bool`	Whether to write the rating information. If True, the rating information will be written in the file.	`True`
`timestamp`	`bool`	Whether to write the timestamp information. If True, the timestamp information will be written in the file.	`True`
`verbose`	`bool`	Print out additional information.	`True`

Returns:

Type	Description
	(CSV/TSV file)

Source code in datarec/io/writers.py

def write_tabular(rawdata: RawData, path, sep='\t', header=True, decimal='.',
                  user=True, item=True, rating=True, timestamp=True, verbose=True):
    """
    Write a RawData dataset to a CSV/TSV file.

    Args:
        rawdata (RawData): RawData instance.
        path (str): Path to the CSV/TSV file.
        sep (str): Separator to use.
        header (bool or list[str]): Write out the column names. If a list of strings is given it is assumed to be aliases for the column names.
        decimal (str): Character recognized as decimal separator.
        user (bool): Whether to write the user information. If True, the user information will be written in the file.
        item (bool): Whether to write the item information. If True, the item information will be written in the file.
        rating (bool): Whether to write the rating information. If True, the rating information will be written in the file.
        timestamp (bool): Whether to write the timestamp information. If True, the timestamp information will be written in the file.
        verbose (bool): Print out additional information.

    Returns:
        (CSV/TSV file)

    """
    cols = []
    if user:
        if rawdata.user:
            cols.append(rawdata.user)
        else:
            raise ValueError('User column not defined in the DataRec.')
    if item:
        if rawdata.item:
            cols.append(rawdata.item)
        else:
            raise ValueError('Item column not defined in the DataRec.')
    if rating:
        if rawdata.rating:
            cols.append(rawdata.rating)
        else:
            raise ValueError('Rating column not defined in the DataRec.')
    if timestamp:
        if rawdata.timestamp:
            cols.append(rawdata.timestamp)
        else:
            raise ValueError('Timestamp column not defined in the DataRec.')

    data: pd.DataFrame = rawdata.data[cols]

    if sep in ACCEPTED_TAB_DELIMITERS:
        if sep == "::":
            file = data.to_csv(sep='*', header=header, index=False, decimal=decimal)
            file.replace('*', '::')
            with open(file, 'w') as f:
                f.write(file)
        else:
            data.to_csv(path, sep=sep, header=header, index=False, decimal=decimal)
            if verbose:
                print(f'A dataset has been stored at \'{path}\'')
    else:
        raise ValueError

`write_json(rawdata, path, user=True, item=True, rating=True, timestamp=True)`

Write a RawData dataset to a JSON file. Args: rawdata (RawData): RawData instance. path (str): Path to the JSON file. user (bool): Whether to write the user information. If True, the user information will be written in the file. item (bool): Whether to write the item information. If True, the item information will be written in the file. rating (bool): Whether to write the rating information. If True, the rating information will be written in the file. timestamp (bool): Whether to write the timestamp information. If True, the timestamp information will be written in the file.

Returns:

Type	Description
	(JSON file)

Source code in datarec/io/writers.py

def write_json(rawdata: RawData, path, user=True, item=True, rating=True, timestamp=True):
    """
    Write a RawData dataset to a JSON file.
    Args:
        rawdata (RawData): RawData instance.
        path (str): Path to the JSON file.
        user (bool): Whether to write the user information. If True, the user information will be written in the file.
        item (bool): Whether to write the item information. If True, the item information will be written in the file.
        rating (bool): Whether to write the rating information. If True, the rating information will be written in the file.
        timestamp (bool): Whether to write the timestamp information. If True, the timestamp information will be written in the file.

    Returns:
        (JSON file)

    """

    cols = []
    if user:
        cols.append(rawdata.user)
    if item:
        cols.append(rawdata.item)
    if rating:
        cols.append(rawdata.rating)
    if timestamp:
        cols.append(rawdata.timestamp)

    data: pd.DataFrame = rawdata.data[cols]

    data.to_json(path, orient='records', lines=True)

`get_cache_dir(app_name='datarec', app_author='sisinflab')`

Returns the appropriate cache directory for the library, creating it if it doesn't exist. Respects the DATAREC_CACHE_DIR environment variable if set.

Returns:

Name	Type	Description
`Path`		The absolute path to the cache directory.

Source code in datarec/io/paths.py

def get_cache_dir(app_name="datarec", app_author="sisinflab"):
    """
    Returns the appropriate cache directory for the library, creating it if it doesn't exist.
    Respects the DATAREC_CACHE_DIR environment variable if set.

    Returns:
        Path: The absolute path to the cache directory.
    """
    env_override = os.getenv("DATAREC_CACHE_DIR")
    path = Path(env_override) if env_override else Path(user_cache_dir(app_name, app_author))
    path.mkdir(parents=True, exist_ok=True)
    return path

`dataset_directory(dataset_name, must_exist=False)`

Given the dataset name returns the dataset directory Args: dataset_name (str): name of the dataset must_exist (bool): flag for forcing to check if the folder exists

Returns:

Type	Description
`str`	the path of the directory containing the dataset data

Source code in datarec/io/paths.py

def dataset_directory(dataset_name: str, must_exist=False) -> str:
    """
    Given the dataset name returns the dataset directory
    Args:
        dataset_name (str): name of the dataset
        must_exist (bool): flag for forcing to check if the folder exists

    Returns:
        (str): the path of the directory containing the dataset data
    """
    dataset_dir = os.path.join(DATA_DIR, dataset_name)
    if must_exist and not os.path.exists(dataset_dir):
        raise FileNotFoundError(f'Directory at {dataset_dir} not found. Please, check that dataset directory exists')
    return os.path.abspath(dataset_dir)

`dataset_raw_directory(dataset_name)`

Given the dataset name returns the directory containing the raw data of the dataset Args: dataset_name (str): name of the dataset

Returns:

Type	Description
`str`	the path of the directory containing the raw data of the dataset

Source code in datarec/io/paths.py

def dataset_raw_directory(dataset_name: str) -> str:
    """
    Given the dataset name returns the directory containing the raw data of the dataset
    Args:
        dataset_name (str): name of the dataset

    Returns:
        (str): the path of the directory containing the raw data of the dataset
    """
    return os.path.join(dataset_directory(dataset_name), RAW_DATA_FOLDER)

`dataset_processed_directory(dataset_name)`

Given the dataset name returns the directory containing the processed data of the dataset Args: dataset_name (str): name of the dataset

Returns:

Type	Description
`str`	the path of the directory containing the processed data of the dataset

Source code in datarec/io/paths.py

def dataset_processed_directory(dataset_name: str) -> str:
    """
    Given the dataset name returns the directory containing the processed data of the dataset
    Args:
        dataset_name (str): name of the dataset

    Returns:
        (str): the path of the directory containing the processed data of the dataset
    """
    return os.path.join(dataset_directory(dataset_name), PROCESSED_DATA_FOLDER)

`dataset_filepath(dataset_name)`

Given the dataset name returns the path of the dataset data Args: dataset_name (str): name of the dataset

Returns:

Type	Description
`str`	the path of the dataset data

Source code in datarec/io/paths.py

def dataset_filepath(dataset_name: str) -> str:
    """
    Given the dataset name returns the path of the dataset data
    Args:
        dataset_name (str): name of the dataset

    Returns:
        (str): the path of the dataset data
    """
    return os.path.join(dataset_directory(dataset_name), DATASET_NAME)

Framework Interoperability

This section covers the tools used to export DataRec datasets into formats compatible with other popular recommender systems libraries.

`FrameworkExporter`

Exporter for converting RawData datasets to external recommender system frameworks.

Provides methods to format a RawData object according to the expected schema of supported libraries (e.g., Cornac, RecBole).