It would be nice to make plugins in kdenlive

ultraderek · January 27, 2024, 7:54pm

It would be nice to be able to make and share plugins. Lets say we want to add some sort of automation, effect maker or add an AI model integrated into the system.

Nathan · February 1, 2024, 5:44pm

There are effect templates, but I don’t know about anything more advanced.

Eugen_Mohr · February 2, 2024, 8:40am

Add-on integration is something Kdenlive team is thinking about. But the integration is not easy at the moment. Other aspects are security reasons which we have to check.

YousufPro67 · September 16, 2024, 5:38pm

@Eugen_Mohr Sorry for late reply; I love kdenlive but the only problem I’m facing is the fact that we can’t make custom effects but only presets. I use templates but sometimes I have to resize them etc. I have to use lightworks (as it is available on Linux too and has custom effects not presets).

einar · December 30, 2024, 12:25pm

I just reviewed Kdenlive Roadmap - Kdenlive

Under “Long term” I find

Scripting support (eg. Python API)

I reaaaaally think this should have a higher priority, as it would enable the community to contribute in a completely different way and thus make kdenlive more competative. Especially in this time of AI-tooling not having api/scripting/rpc/plugin is a bottleneck. I can imagine all kinds of tools popping up around kdenlive if there would only be a way. And it may offer an alternative resolution to many of the other issues listed in the roadmap (some may be solved as community plugins, at least as an intermediary or prototype solution).

I understand that it is a question of resources but I just want to present the viewpoint that scarce resources is just another argument why this should be prioritized, over adding other new features in the core software. Ideally, kdenlive would be as extensible and customizable as possible. This would be a competative advantage to other closed products.

Just look at neovim!
(PS. Lua may be a good alternative to Python)

Ron · December 30, 2024, 2:49pm

But would it? Would it reaaaaally?? What exactly are you wanting to contribute which could only be done this way?

A bottleneck for what? There’s a lot of buzzwords squeezed in there, but how could we possibly design a useful api/scripting/rpc/plugin without even a single concrete example of what you want to use it for?

There are ways, and they have. What kind of tools are you imagining that there currently is no way for?

Which ones?

Then you’re way ahead of me on this and you’re going to have to bring the rest of us up to speed…

If I was to drop everything on my current todo list and make “this” my highest priority - I still have absolutely no idea what you reaaaaally want to do with it, whether any of what you’re imagining is actually possible to do in kdenlive (as opposed to in some other part of the toolchain), or what you imagine would make things easier or more accessible than the ways in which they are currently already possible for people to do them…

Why do you say “ideally”? It’s open source software with a CC-SA licence - you don’t really get more “as extensible and customizable as possible” than that.

It is.

For what? Which image processing tasks do you think an interpreted script language might be a good choice for?

micha · December 30, 2024, 5:24pm

Kdenlive will never be able to compete with the big, absolutely professional editors, and it doesn’t have to.
I think the creators and the community need to ask themselves where they want to go.

The most important thing for me would be that Kdenlive is stable. What I need is reliability. Since it is an opensource software, I don’t want to make any demands, but rather be grateful for what it can and offers.

Reliability is 100 times better to me than lots of new features that are great but don’t work 100%. There is no alternative, especially for the many Linux users. And that’s why quality should always come before quantity.

The top priority should be that you can make and render your project at any time, without the nasty surprises because something new is nice but doesn’t work.

Ron · December 30, 2024, 6:13pm

I get your sentiment here - but I wouldn’t go so far as to say that … there’s no fundamental reason it can’t, and for many people it already does.

I agree we don’t have to treat it as a competition though, it can become best of breed and implement good ideas on merit, not just because Someone Else Does Them.

And I’m not at all saying we shouldn’t be talking about ways to improve it. All I’m saying is that “It needs more ponies, ponies are kewl!” isn’t really a very actionable request.

If someone has a real use case for a thing they’d really benefit from being able to script. Or a real idea for some class of plugins, that isn’t already possible via the several plugin mechanisms already available. Or something like that. Then we can have a discussion about what options there are, what might need to be done, what the pros and cons are, what a design might look like, what functionality it must provide.

If nobody can think of a use case that isn’t entirely handwavy, entirely contrived, entirely unrealistic, or entirely a cloud of If We Build It They Will Come pixie dust - then we’re still a few steps short of knowing what problem this might be the Perfect Solution to.

Picking a language at random most certainly isn’t the first thing that needs to be done.

42

berndmj · December 30, 2024, 6:30pm

LOL. Wish it was that easy

Ron · December 31, 2024, 3:22am

FWIW, what lightworks does, does seem to be a nice idea. In addition to letting you save effect stacks like kdenlive already does, it lets you define your own pixel shaders that run on the GPU.

Which isn’t quite the same thing as “custom effects”, but it is a good and logical place to “plug things in” with a well defined interface.

This is how movit works - and I don’t really know what the ongoing issue between it and mlt is, but if that is solved, this might actually be a good thing to have on the roadmap.

YousufPro67 · January 4, 2025, 8:19am

Can anyone tell me what short, mid and long term means in the kdenlive roadmap?

berndmj · January 4, 2025, 4:31pm

Please note that there is one main developer with some code contributors few and far between. So, depending on the complexity of the task it can take longer than anticipated. Also, keep in mind that the roadmap may change, and something that was planned mid or long-term may be moved up, and vice versa. But generally speaking, short term means it’s planned for the next 2-3 releases, mid-term 3-4, and long-term 4-5. Releases are .4, .8, and .12 in the yearly release cycle, i.e. 25.04.0 is the next release and also the first one in 2025. There are bug fixes that are released in between releases and are simply a sequential number in the last digit of the version (e.g. 24.08.3).

DontBother_YT · January 25, 2025, 7:48pm

One plugin idea that I would love to see is an Audio-Reactive image plugin. Nice and simple concept: Taking an audio clip (not the Master) and making a separate image react to it. Similar to what Discord does in voice chats where it lights up who is speaking.

Problem: It is impossible to know which combinations of preset effects will allow me to have that completed composite.

With a scripting plugin system, I could actually build that reactive Image effect without having to hunt, peck, and pre-process audio just to get it to show up in my project monitor.

I know there’s much more to adding that type of system than what I am aware of. So security, reliability and many other factors would require awareness before publishing that type of add-on to an already-stable program.

sherkhan30452 · January 29, 2025, 2:43pm

I mentioned this already, (regarding lua) last year.

https://discuss.kde.org/t/feature-request-use-lua-scripts-as-plugins-to-automate-small-tasks/14019

Also here too
https://discuss.kde.org/t/clip-bin-job-create-job-to-change-duration-of-image-clip-s/12424/3?u=sherkhan30452

And I understand the situation, so I am not asking again.

sherkhan30452 · January 29, 2025, 3:55pm

I tried to do automation, but using 3rd party software.

Kdenlive Clip Manager | Ft. FFmpeg & ImageMagick

Automating Kdenlive | Ft. Actiona

einar · April 10, 2025, 1:43pm

I do understand that my question could come across as "oh just give me more shiny things". And as I said, I understand that in this kind of project time/energy is scarce and have to prioritize wisely.

I wasn’t however prepared for the level of push back.

Of course it would have helped if I’d presented some concrete API/plugin ideas. Of course. But I didn’t do it as I thought “hey, shouldn’t it be fairly obvious that an API would be used in many ways probably not anticipated right here and now?”.

And also there is this AI-thing going on in the world right now.

I have a lot of issues with the AI-hype and its impact on society, democracy, sustainability etc. But even more so I want not-for-profit driven organisations such as KDE and projects like Kdenlive to succeed. And without any doubt, AI-integrations will be the norm eventually and it is already starting.

There is now a new open standard that has emerged and is being embraced that addresses the question of how to integrate LLMs with software and services. It is called Model Context Protocol, you may have heard of it if you are a developer. It relies on a server <-> client relationship where the server is the thing which has capabilites (resources, tools, prompts) and the client is perhaps your terminal AI assistant, Claude Desktop or whatever you use to interact with your LLM.

I use this on a daily basis as a developer, not like a naive “vibe coder” doing things to show off on youtube, but rather as a practical helper assistant and extended brain for suitable tasks which are within reach.

MCP-servers for software are now starting to pop up here and there. This includes 3D-modelling program Blender (by the non-profit Blender foundation) and commercial competitors in the video editing space such as DaVinci Resolve.

I have no idea how capable these early implementations are at the time of this writing, but they do define a new world where all kinds of software will start to share a common way for enable interaction though human natural language. This can be written - or perhaps more commonly down the road - spoken.

I push this issue not from a point of an immediate need, but rather as an early signal for what is coming and with a sense of urgency in terms of starting to explore that path. You say that the current infrastructure is an obstacle here, making a public API/scripting harder. Well - isn’t this just another reason to start looking at this sooner rather than later? The process itself doesn’t have to be rushed, but the direction is important I think.

Why / how this matters to me personally? Well, I’m a jack of many trades. Which means I also don’t have the luxury to only work with one thing, eg. video-editing. I jump between disciplines in my daily work. At the same time I’m an engineer and I like working with proper tools whenever possible. And - from an ideological standpoint - I want to use FOSS and / or software from non-profits (even if it is paid). Therefore I’m here, in this forum.

But I will never be fluent / expert in Kdenlive. So therefore a human language interface would be useful / helpful (i can also imagine other automation helpers for my kdenlive work, eg. importing and using assets a lot more easily). And I don’t think the proper way to do it is to develop it yourself, but rather just expose the functionality and let users leverage whatever AI-toolchain they have available - today or tomorrow. The alternative is that Kdenlive developer team itself takes on the task to provide all these features and I don’t think that is neither the best outcome/UX nor resource effective. Use standards.

This isn’t about today, it is about tomorrow and where things are heading.

Ron · April 11, 2025, 2:45am

If the collision between reality and dreams is “push back”, then ok … but that’s not what I would have called it. I’m sorry if I came across as a bit brutal - but it seemed appropriate to be very specific about how you might make more practical suggestions if the overarching theme of the problem was that you were being inactionably vague and/or impractically romantic about what you actually wanted to do.

And what if one of the “unanticipated ways” is just people won’t use it at all, and it won’t actually solve any of their Real World problems?

Because there’s actually evidence for that outcome. There’s been a plugin API for effects for a very long time. And a completely open ‘API’ that can Do Anything. People saying “I can’t do X with that interface, even though I’ve tried Y and Z” have an actionable problem.

“I can’t even imagine how this might be used yet, but it needs plugins” is mostly just a big red flag on the road to a very poor design. I want world peace is a lovely dream, but neither that nor “you just need to be nice to each other” is an actual plan for how to achieve it.

That’s not a standard, that’s a github project. And a volatile and experimental one at that.

No actual problem, and no actual experience with, or research into, the proposed solution isn’t really the usual foundation for a successful and useful API. And I’m not having a dig at you there. It’s just that I’ve designed and deployed and been on point for maintaining enough of them to know that it is always a mistake to try to implement something speculatively, that you personally have no use for and that nobody else has ever detailed a specific use case for.

If you ignore that and do it anyway, you will just create something that works well for nobody at all. And if or when the first real user actually makes themselves known, their real world requirements will need it to work differently, except now you’ll have a legacy mess getting in the way of doing that well, which if you change then someone else will come out of the woodwork to say “hey I was using that in a dirty hack of my own”… cf. xkcd: Workflow

That’s a dark alley you really don’t want to go down without a torch.

what functionality, and how?

It’s all already exposed in C++, and some is exposed though a variety of other interfaces. There is no conspiracy to hide the functionality which we just need to remove.

If none of those work for something you actually want to do, then it’s useful to talk about what you want to do and why they don’t do it. It’s not about whether it’s in the sooner or later to look at column in the road map, so much as there simply being nothing to look at if there is no concrete use case. The first step in being able to use human language to ‘solve’ a problem is still going to be finding the human language to actually define that class of problems to the other humans who you are hoping will implement it for you in a way that it’s actually possible to implement for that particular use.

And really, if your use is “I want an effect that flashes the screen red”, the best answer is almost certain not going to be “we should implement a general purpose NLP oracle!”. A large part of why people coming from other software to kdenlive find it so nice to use once they get over the small muscle memory differences is precisely because the implementation is being driven by real users and real use cases, and not by speculation on what Killer Feature might be The One to make it the leading marker leader in market leading next year with big bonuses for all the executive staff.

It’s not push back to say “when you have a real problem today, it will help everyone much more to tell us about that”.

mao · May 10, 2025, 7:32pm

I’d love to eventually have a way to develop plugins / add-ons / effects(?) for Kdenlive to suit my workflow / make it simplet to implement ideas etc…

Exemple: I’m editing a video with multiple people speaking, and I have a few ‘personas’ for each speaker, with different expressions, etc… And I’d want to create a tool to simplify the ‘manual labor’ of doing things completely by hand (import each image, place them accordingly, etc). I could just drop the ‘frames’ I need, and switch them with keyframes.

It absolutely CAN be done without a custom system, it’s just more tedious.

Not asking anything here, juste stating a potential usecase if anyone passes by.

I’m grateful that such a wonderful tool exists, and it has already pretty much everything I’ll ever need in an editing software, thanks for the hard work of everyone involved !

Paul_Brown · May 10, 2025, 7:34pm

Disclaimer: not a dev, but… There are already at least two plugins that I know of—the caption writer and the background separator thingy. If I am not wrong, both written in Python. So I guess there is a framework.

Ron · May 11, 2025, 4:23am

I’m not quite sure what you mean by “the caption writer” - but if the “background separator thingy” is the support for SAM2 that’s been recently added, then that’s not a “plugin”.
It is an external tool that’s been integrated to use with kdenlive, but there’s code written to explicitly and specifically support that. It’s not been bolted on via some generic API.

The gap between wish and reality is that all of these different wishes would need different interfaces exposed. The access to all of them is via the C++ code, which anyone is already free to modify to develop “add-ons / effects / automation” etc.

A ‘plugin’ interface for pixel shaders and effects isn’t going to enable you to automate timeline tasks, or vice versa.

And there already is at least two plugin interfaces for effects that have been available for a very long time, frei0r and MLT.

We all want “better automation” and less “manual labour”, and anything that helps to build that - but “just add plugins” isn’t a magic pill that can magically make everything easy.

The “I’d love to develop effects” argument would be a lot more convincing if people were actually using, or trying to use, the existing interfaces for that, and then explaining why the existing interfaces were insufficient and needed to be improved. That would be actionable. But if nobody is using what we already have, why would we expect they might use Yet Another Interface for that. Or what basis is there for claiming that it fixes some shortcoming of what already exists?

I’m not trying to be the grinch here - just trying to close the gap a bit between what some users appear to be imagining, and what it’s actually possible to do.

We can’t design an interface to “place images accordingly” without a very precise definition of what “accordingly” actually means in that context. And we can’t design a generic interface for that unless everyone who might want to use it can agree on an accordingly that works for all of them.

Plugins work when you have “a very specific and well defined job” to be done, and multiple, optional, ways to do it which not every user might want to be burdened with installing the dependencies for, or not every user on every platform is able to use.

They can’t do anything that the interface for them wasn’t explicitly designed to allow, and they probably aren’t even the easiest way to do many of the things that people are imagining they might somehow conjure up.

If people really want easier ways to do specific things, then describe what you’re doing in detail, not in abstract terms, and then describe what you think might make doing that easier/more efficient/more accurate. That’s a valuable feature request. If you’re able to, submit some code that either does it, or sketches it out as a proof of concept. But if you’re not able to, then you really are better off sticking to describing your actual problem and letting the people who might code it for you use their best judgement on how to implement it.