Ballo Index excludes C source code

As mentioned in the title, baloo seems to exclude C source code but it indexes CPP file just fine. Here’s some example:

Indexing:tests/sized_delete.cpp (Delete whole directory due to privacy concern).

But for C source file

Indexing:C/Readline_Strok/a.out: Ok
Indexing:C/Readline_Strok/readline: Ok

However, ls that directory gives following output:

a.out  readline  readline.c

Baloo index settings:

[Basic Settings]
Indexing-Enabled=true

[General]
dbVersion=2
exclude filters=*~,*.part,*.o,*.la,*.lo,*.loT,*.moc,moc_*.cpp,qrc_*.cpp,ui_*.h,cmake_install.cmake,CMakeCache.txt,CTestTestfile.cmake,libtool,config.status,confdefs.h,autom4te,conftest,confstat,Makefile.am,*.gcode,.ninja_deps,.ninja_log,build.ninja,*.csproj,*.m4,*.rej,*.gmo,*.pc,*.omf,*.aux,*.tmp,*.po,*.vm*,*.nvram,*.rcore,*.swp,*.swap,lzo,litmain.sh,*.orig,.histfile.*,.xsession-errors*,*.map,*.so,*.a,*.db,*.qrc,*.ini,*.init,*.img,*.vdi,*.vbox*,vbox.log,*.qcow2,*.vmdk,*.vhd,*.vhdx,*.sql,*.sql.gz,*.ytdl,*.class,*.pyc,*.pyo,*.elc,*.qmlc,*.jsc,*.fastq,*.fq,*.gb,*.fasta,*.fna,*.gbff,*.faa,po,CVS,.svn,.git,_darcs,.bzr,.hg,CMakeFiles,CMakeTmp,CMakeTmpQmake,.moc,.obj,.pch,.uic,.npm,.yarn,.yarn-cache,__pycache__,node_modules,node_packages,nbproject,.venv,venv,core-dumps,lost+found
exclude filters version=8
folders[$e]=$HOME/
only basic indexing=false

Looks like C source files aren’t excluded. Any idea why it can’t index C source file?

did you check balooctl6 config list excludeMimetypes ?

also when pasting blocks of code here it’s helpful for the reader if you use

3-backticks 
to 
encase your code block

1 Like

Excute balooctl6 config list excludeMimetypes, Here is the output

text/csx
text/x-python
text/x-copying
text/x-java
application/ld+json
application/x-awk
application/geo+json
text/x-sed
text/x-pascal
text/x-cmake
application/x-cgi
application/x-javascript
text/x-c++src
application/xml
text/x-yacc
application/json-patch+json
text/x-ruby
application/x-ipynb+json
application/x-java
text/x-lua
text/x-chdr
text/x-assembly
text/x-csrc
text/x-objsrc
text/x-haskell
text/x-readme
application/json
application/x-python
application/pgp-encrypted
text/vnd.trolltech.linguist
text/asp
application/x-php
text/x-c++hdr
text/x-erlang
application/x-perl
application/x-sh
application/x-csh
text/x-scheme
text/x-qml
application/javascript
text/x-fortran
text/css
text/jsx

Use file command to get file mimnetype, Here’s the result:

C/Rand/main.c: text/x-c; charset=us-ascii

Looks like this file type isn’t excluded.

tho… try removing that from the list.

and maybe text/x-csrc as well.

1 Like

Remove them, but it still didn’t work. and balooshow doesn’t give me the cached information. Here are some examples:

17207bc5d579c30 1566022704 24250300 music_tx_hackrf/untitled.grc
        Mtime: 1701159141 2023-11-28T16:12:21
        Ctime: 1701159141 2023-11-28T16:12:21
        Cached properties:
                Line Count: 90
17276735d579c30 1566022704 24278643 Music/START:DASH!!.mp3
        Mtime: 1599544335 2020-09-08T13:52:15
        Ctime: 1720859987 2024-07-13T16:39:47
        Cached properties:
                Bitrate: 192000
                Channels: 2
                Duration: 257
                Sample Rate: 44100
                Release Year: 20200917
                Comment: https://www.youtube.com/watch?v=-YpYkwM2ay4
                Artist: μ's
                Album: μ's Best Album Best Live! Collection Ⅱ
                Title: START:DASH!!

As you can see, when baloo successfully indexed a file, balooshow command will present cached properties. But for C source code, here’s the output.

174462d5d579c30 1566022704 24397357 Rand/main.c
        Mtime: 1732005994 2024-11-19T16:46:34
        Ctime: 1732005994 2024-11-19T16:46:34

No line count. The only way to get line counts of C source file in baloo is to clear a specific file index data and re-index it. And if I re-index it, it can be searched correctly in Dolphin. Still thanks for your help. And now I believe it’s a bug in baloo.

In case anyone need to index C source file, here’s a link: 497360 – Baloo Unable to Index contents in C Source File. Please note that mimetypes defined in KDE is different from what file command gives us. For C source, file gives us mimetype text/x-c, but in KDE, it’s actually text/x-csrc. In case anyone need to confirm what a mimetype of a file is defined in KDE, he needs to use command kmimetypefinder.

1 Like