Import Json API

MrNuggelz · October 16, 2019, 1:54pm

Hello, I’m an interessted in creating a Json API for the import. For this I implementet a custom ImportSession, recording every task in choose_match while skipping them. This is used to build a list of found matches and required decisions for the user.

But now I have the problem of importing one of the matches, with the descision of the user, after the Session is finished. Is there a good solution for this already or how could this be solved without changeing to much of the current way imports work?

adrian · October 16, 2019, 4:58pm

Wow; that’s quite ambitious! Is your eventual goal here to build a new user interface on top of that JSON API? Sounds cool.

The problem you printed out—i.e., “reviving” an import task asynchronously, perhaps after beets itself has terminated—is indeed the hard part. There will be no easy way around this. There might, however, be several hard ways to do it. You might find these issue threads enlightening:
https://github.com/beetbox/beets/issues/3073 https://github.com/beetbox/beets/issues/736

I think the thing you’ll need to do is create a way to serialize an ImportTask and reconstitute it later as part of a new session. Does that make sense?

MrNuggelz · October 17, 2019, 9:54am

Thanks for the info! A web interface is not my direct goal, as im not really good at frontend development xD
My first goal is to just implement the api, so its possible to run imports remotely without ssh.

For an easy beginning I think I will just save the ImportTasks in memory, solving persistence when the import is working. As for the import I plan to override the run method to only read tasks, look up the canditates and then implement an new pipeline stage that saves the tasks.
Then I will write a function to update a task according to some user input from an endpoint.
What I still need to find is the point in the normal pipeline from which i should continue to apply the choice for a task and import it to the database, especially when it comes to plugins.

adrian · October 17, 2019, 11:43am

Got it. It seems like you’ll want to set up an import pipeline that only has the last few stages, after a decision is made. If you look at ImportSession.run, you can see the standard order in which stages are run:

github.com

beetbox/beets/blob/a1d1265e8b135232addea7eea3273c53f221097c/beets/importer.py#L284




def choose_match(self, task):
    raise NotImplementedError


def resolve_duplicate(self, task, found_duplicates):
    raise NotImplementedError


def choose_item(self, task):
    raise NotImplementedError


def run(self):
    """Run the import task.
    """
    self.logger.info(u'import started {0}', time.asctime())
    self.set_config(config['import'])


    # Set up the pipeline.
    if self.query is None:
        stages = [read_tasks(self)]
    else:
        stages = [query_tasks(self)]

For your purposes, I think you’ll want to include everything after user_query—crucially, the plugin stages and manipulate_files. But before injecting things back into the pipeline, you’ll probably want to call apply_choice to save things back to the database:

github.com

beetbox/beets/blob/a1d1265e8b135232addea7eea3273c53f221097c/beets/importer.py#L1481


    when the importer is run without autotagging.
    """
    if task.skip:
        return


    log.info(u'{}', displayable_path(task.paths))
    task.set_choice(action.ASIS)
    apply_choice(session, task)




def apply_choice(session, task):
    """Apply the task's choice to the Album or Item it contains and add
    it to the library.
    """
    if task.skip:
        return


    # Change metadata.
    if task.apply:
        task.apply_metadata()
        plugins.send('import_task_apply', session=session, task=task)

In general, the design is such that pipeline stages never really have “in-flight” changes to Album and Item objects that are not saved to the database. Each stage is responsible for calling store to ensure that updates are persisted.

macleodmike · November 5, 2019, 2:59am

I was thinking the other day that storing import jobs as their own objects in the DB might actually be useful. The imports table would store import job records, keeping track of original directory, command line options passed in, the match choice, status, etc.

Typical use cases would include being able to run an initial import of a large collection of music while setting quiet and timid, and come back and query for any import jobs where the autotagger wants your input. Or alternatively, you could query for import jobs that are completed, but where the original imports originated in a particular directory.

Each album and item would also record which import job it originated with. You could query the library to determine which import job a given file was originally associated with.

Perhaps this would be best as a separate thread, but this would certainly lend itself towards a web dashboard for imports in the future.

MrNuggelz · November 6, 2019, 1:23pm

I actually started building the webinterface for the import. It is in a very rudimentary state, but if you want to take a look you can find it in my fork (https://github.com/MrNuggelz/beets/tree/ImportWebInterface)
There is a list of things that i think should be done before a release:

Implement Search by Artist and Album as well as by Id
Choosing different candidates
Display difference/changes like its done in the console
Handle existing Albums/Tracks (merge, delete etc.)
Saving ImportTasks in the database

MrNuggelz · November 8, 2020, 2:04pm

Sorry for the long break. I finished building a very rough prototype, but was never able to improve it. So now I decided to move my work to a plugin, so people have at least the possibility to test it easily. https://github.com/MrNuggelz/beets-web-import

samthursfield · November 12, 2020, 4:07pm

Sounds interesting! Do you have a screenshot of how the web interface looks ?

MrNuggelz · November 18, 2020, 12:06pm

The frontend is very bare bones but I added an example screenshot and also updated some info in the Readme

samthursfield · November 18, 2020, 12:30pm

Thanks. It looks promising in my opinion!

talski · May 25, 2021, 3:50am

Hello, nice work, I have interest in this API, I want to build an UI to import my library, but I don’t have any knowledge in python, so I guess I will use your plugin as a server.

One thing I would like is to let the api fetch all data, then in my spare time, I review and take any actions, but if the server goes down, I would need to run the initial import to fetch metadata all again, as it saves the data on session, it would be nice if it saves the data on some database like sqlite or mongodb, so it can also resume imports.

I’ll start build something in flutter, if anyone has any thoughts.

Topic		Replies	Views
Importing From Python Help	14	2610	November 11, 2019
Import Logging Help	2	412	September 11, 2019
A (reasonably) new commandline UI for Beets! Development	6	1362	August 27, 2023
Scheduling daily import on Synology Help	6	875	November 17, 2022
Import resume forgets track choices Help	0	216	February 18, 2023

Import Json API

Related topics