actions/mark - mark - Gitea: Git with a cup of tea

mirror of https://github.com/kovetskiy/mark.git synced 2026-04-18 12:11:12 +00:00

Author	SHA1	Message	Date
Manuel Rüger	ac264210b5	Feature/robust comment preservation (#768 ) This is based on guoweis-work PR https://github.com/kovetskiy/mark/pull/145 * feat(confluence): add support for fetching page body and inline comments * feat(cmd): add --preserve-comments flag to preserve inline comments * feat(mark): implement context-aware inline comment preservation * test(mark): add tests for context-aware MergeComments logic * fix: remove empty else branch in MergeComments to fix SA9003 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * perf: compile markerRegex once as package-level variable Avoids recompiling the inline comment marker regex on every call to MergeComments, which matters for pages with many comment markers. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: guard against nil comments pointer in MergeComments Prevents a panic when GetInlineComments returns nil (e.g. on pages where the inline comments feature is not enabled). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * test: add edge-case tests for MergeComments; fix overlapping replacement Four new test cases: - SelectionMissing: comment dropped gracefully when text is gone from new body - OverlappingSelections: overlapping comments no longer corrupt the body; the later match (by position) wins and the earlier overlapping one is dropped - NilComments: nil pointer returns new body unchanged - HTMLEntities: <, >, ' selections match correctly Also fixes the overlapping replacement bug: apply back-to-front and skip any replacement whose end exceeds the start of an already-applied one. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: escape ref attribute value in inline comment marker XML Use html.EscapeString on r.ref before interpolating it into the ac:ref attribute to prevent malformed XML if the value ever contains quotes or other special characters. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: use first occurrence when no context is available in MergeComments Without context the old code left distance=0 for every match and updated bestStart on each iteration, so the final result depended on whichever occurrence was visited last (non-deterministic with respect to the search order). Restructure the loop to break immediately on the first match when hasCtx is false, making the behaviour explicit and deterministic. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: log warning when overlapping inline comment marker is dropped Previously the overlap was silently skipped. Now a zerolog Warn message is emitted with the ref, the conflicting byte offsets, and the ref of the already-placed marker, so users can see which comment was lost rather than silently getting incomplete output. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: warn when inline comments are silently dropped in MergeComments Three cases now emit a zerolog Warn instead of silently discarding: 1. Comment location != "inline": logs ref and actual location. 2. Selected text not found in new body: logs ref and selection text. 3. Overlapping replacement (existing): adds selection text to the already-present overlap warning for easier diagnosis. Also adds a selection field to the replacement struct so the overlap warning can report the dropped text. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: update markerRegex to match markers with nested tags Replace ([^<]) with (?s)(.?) so the pattern: - Matches marker content that contains nested inline tags (e.g. <strong>) - Matches across newlines ((?s) / DOTALL mode) The old character class [^<]* stopped at the first < inside the marker body, causing the context-extraction step to miss any comment whose original selection spanned formatted text. Add TestMergeComments_NestedTags to cover this path. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: guard against empty OriginalSelection in MergeComments strings.Index(s, "") always returns 0, so an empty escapedSelection would spin the search loop indefinitely (or panic when currentPos advances past len(newBody)). Skip comments with an empty selection early, emit a Warn log, and add TestMergeComments_EmptySelection to cover the path. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: paginate GetInlineComments to avoid silently truncating results The Confluence child/comment endpoint is paginated. The previous single-request implementation silently dropped any comments beyond the server's default page size. Changes: - Add Links (context, next) to InlineComments struct so the _links field from each page response is decoded. - Rewrite GetInlineComments to loop with limit/start parameters (pageSize=100), accumulating all results, following the same pattern used by GetAttachments and label fetching. - Add TestMergeComments_DuplicateMarkerRef to cover the deduplication guard added in the previous commit. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Fix UTF-8 safety, API compat, log verbosity - levenshteinDistance: convert to []rune before empty-string checks so rune counts (not byte counts) are returned for strings with multi-byte characters - Add contextBefore/contextAfter helpers that use utf8.RuneStart to avoid slicing in the middle of a multi-byte UTF-8 sequence when extracting 100-char context windows from oldBody and newBody - Add truncateSelection helper (50 runes + ellipsis) and apply it in all Warn log messages that include the selected text, preventing large or sensitive page content from appearing in logs - Downgrade non-inline comment log from Warn to Debug with message 'comment ignored during inline marker merge: not an inline comment'; page-level comments are not inline markers and are not 'lost' - Restore original one-argument GetPageByID (expand='ancestors,version') and add GetPageByIDExpanded for the one caller that needs a custom expand value, preserving backward compatibility for API consumers Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Address new PR review comments - Remove custom min() function: shadows the Go 1.21+ built-in min for the entire package; the built-in handles the 3-arg call in levenshteinDistance identically - Validate rune boundaries on strings.Index candidates: skip any match where start or end falls in the middle of a multi-byte UTF-8 rune to prevent corrupt UTF-8 output - Defer preserve-comments API calls until after shouldUpdatePage is determined: avoids unnecessary GetPageByIDExpanded + GetInlineComments round-trips on no-op --changes-only runs - Capitalize Usage string for --preserve-comments flag (util/flags.go) and matching README.md entry to match sentence case of surrounding flags - Run gofmt on util/cli.go to fix struct literal field alignment Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * docs: document --preserve-comments feature in README Add a dedicated 'Preserving Inline Comments' section under Tricks with: - Usage examples (CLI flag and env var) - Step-by-step explanation of the Levenshtein-based relocation algorithm - Limitations (deleted text, overlapping selections, new pages, changes-only interaction) Also add a cross-reference NOTE near the --preserve-comments flag entry in the Usage section. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * docs: fix markdownlint errors in README - Change unordered list markers from dashes to asterisks (MD004) - Remove extra blank line before Issues section (MD012) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Extract named types for InlineComments; optimize Levenshtein search - Introduce InlineCommentProperties, InlineCommentExtensions, and InlineCommentResult named types in confluence/api.go, replacing the anonymous nested struct in InlineComments.Results. Callers and tests can now construct/inspect comment objects without repeating the JSON shape. - Simplify makeComments helper in mark_test.go to use the new named types directly, eliminating the verbose anonymous struct literal. - Add two Levenshtein candidate-search optimisations in MergeComments: * Exact-context fast path: if both the before and after windows match exactly, take that occurrence immediately without computing distance. * Lower-bound pruning: skip the full O(mn) Levenshtein computation for a candidate when the absolute difference in window lengths alone already meets or exceeds the current best distance. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Use stable sort with ref tie-breaker; fix README overlap description - Replace slices.SortFunc with slices.SortStableFunc for the replacements slice, adding ref as a lexicographic tie-breaker when two markers resolve to the same start offset. This makes overlap resolution fully deterministic across runs. - Correct the README limitation note: the earlier overlapping match (lower byte offset) is what gets dropped; the later one (higher byte offset, applied first in the back-to-front pass) is kept. The previous wording said 'the second one is dropped' which was ambiguous and inaccurate. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Fix rune-based lower-bound pruning; clarify test comment - Use utf8.RuneCountInString instead of len() for the Levenshtein lower-bound pruning computation. The levenshteinDistance function operates on rune slices, so byte-length differences can exceed the true rune-length difference for multibyte UTF-8 content, causing valid candidates to be incorrectly skipped. - Update TestMergeComments_SelectionMissing comment to say the comment is 'dropped with a warning' rather than 'silently dropped', matching the actual behavior. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Add missing unit tests for helpers and MergeComments scenarios Helper function tests: - TestTruncateSelection: short/exact/long strings and multibyte runes - TestLevenshteinDistance: empty strings, identical, insertions, deletions, substitutions, 'kitten/sitting', and a multibyte UTF-8 case to exercise rune-based counting - TestContextBefore / TestContextAfter: basic windowing, window larger than string, and a case where the raw byte offset lands mid-rune (é) to verify the rune-boundary correction logic MergeComments scenario tests: - TestMergeComments_MultipleComments: two non-overlapping comments both correctly applied via back-to-front replacement - TestMergeComments_EmptyResults: non-nil InlineComments with zero results returns body unchanged - TestMergeComments_NonInlineLocation: page-level comments (location != 'inline') are skipped; body unchanged - TestMergeComments_NoContext: when a ref has no marker in oldBody the first occurrence of the selection in newBody is used - TestMergeComments_UTF8: multibyte (Japanese) characters in both body and selection are handled correctly Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Fix three correctness issues in MergeComments - Fix html import shadowing: alias the 'html' import as 'stdhtml' to avoid shadowing by the local 'html' variable used throughout ProcessFile. Both callers updated: stdhtml.EscapeString for the ref attribute, htmlEscapeText for the selection search. - Fix selection search with quotes/apostrophes: replace html.EscapeString for the selection with a new htmlEscapeText helper that only escapes &, <, > — not ' or ". Confluence storage HTML often leaves quotes and apostrophes unescaped in text nodes, so fully-escaped selections would fail to match and inline comments would be silently dropped. Add TestMergeComments_SelectionWithQuotes. - Fix duplicate-ref warnings: move seenRefs[ref]=true to immediately after the duplicate-check, before the search loop. Previously seenRefs was only set on a successful match, so multiple results for the same MarkerRef with no match in the new body would each emit a 'dropped' warning. Add TestMergeComments_DuplicateMarkerRefDropped. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Optimize levenshteinDistance to use two rolling rows instead of full matrix Reduces memory allocation from O(m×n) to O(n) by keeping only the previous and current rows. Also swaps r1/r2 so the shorter string is used for column width, minimizing row allocation size. This matters in MergeComments where levenshteinDistance is called for every candidate match of every comment's selection in newBody — on pages with many comments or short/common selections the number of calls can be high. Addresses thread [40] from PR review. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Fix test description and README algorithm doc mark_test.go (thread [43]): - TestMergeComments_HTMLEntities: the description incorrectly claimed ' (apostrophe) was tested; the selection '<world>' contains no apostrophe. Updated comment to accurately describe what is covered (</> entity matching) and note the ' limitation. - Add TestMergeComments_ApostropheSelection: verifies a selection with a literal apostrophe is found when the new body also has a literal apostrophe (the common case from mark's renderer). This exercises the htmlEscapeText path which intentionally does not encode ' or ". README.md (thread [42]): - Step 2 of the algorithm description said context was recorded 'immediately before and after the commented selection' which is ambiguous. Clarified that context windows are taken around the <ac:inline-comment-marker> tag boundaries in the old body (not around the raw selection text), so the context is stable even when the marker wraps additional inline markup such as <strong>. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Unexport mergeComments and cap candidate evaluation Thread [44]: MergeComments was exported but is internal-only — only called within the mark package and tested from the same package. Unexport it to mergeComments to avoid expanding the public API surface unnecessarily. Add a Go doc comment describing the function contract, HTML expectations, and the candidate cap. Thread [45]: The candidate-scoring loop had no upper bound. For short or common selections (e.g. 'a', 'the') on large pages the loop could invoke levenshteinDistance thousands of times, each allocating rune and int slices. Add a maxCandidates=100 constant and break once that many on-rune-boundary occurrences have been evaluated. The exact-context fast-path and lower-bound pruning already skip many candidates before Levenshtein is called, so in practice the cap is only reached for very common selections where the 100th candidate is unlikely to be meaningfully better than an earlier one anyway. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * test: fix HTMLEntities description and add ApostropheEncoded limitation test Thread #43: TestMergeComments_HTMLEntities had a misleading note claiming it covered the ' apostrophe case, but the selection under test ('<world>') did not include an apostrophe. Remove that note and add a dedicated TestMergeComments_ApostropheEncoded test that explicitly documents the known limitation: when a Confluence body stores an apostrophe as the numeric entity ', mergeComments cannot locate the selection (htmlEscapeText does not encode ' to '), so the comment is dropped with a warning. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Fix CDATA selection fallback and extract contextWindowBytes constant Thread #46: mergeComments only searched for htmlEscapeText(selection) and would fail for selections inside CDATA-backed macro bodies (e.g. ac:code), where < and > are stored as raw characters rather than HTML entities. Restructure the search loop to build a searchForms slice: the escaped form is tried first (covers normal XML text nodes), and the raw unescaped form is appended as a fallback when they differ. A stopSearch flag exits early on an exact context match or when maxCandidates is reached, preserving the same performance guarantees as before. Add TestMergeComments_CDATASelection to cover this path. Thread #47: The context-window size 100 was repeated in four places across mergeComments (two in the context-extraction loop and two in the scoring loop). Extract it to const contextWindowBytes = 100 so it is easy to tune and stays consistent everywhere. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-08 16:20:03 +02:00
Noam Asor	1c1eeb84fb	feat: add GitHub Alerts transformer and renderers Co-Authored-By: Manuel Rüger <manuel@rueg.eu>	2026-03-31 20:54:07 +02:00
Manuel Rüger	ba67fdc54b	docs: update README for v16 and document task lists	2026-03-30 11:23:01 +02:00
Manuel Rüger	2fdcab25cc	chore: bump module path to v16 Update Go module path from github.com/kovetskiy/mark to github.com/kovetskiy/mark/v16 across all packages and imports, following Go module versioning conventions for major versions >= 2. Also update README installation instructions and version string. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-20 23:44:59 +01:00
copilot-swe-agent[bot]	61510ff358	Fix go install/get commands to use correct cmd/mark path Co-authored-by: mrueg <489370+mrueg@users.noreply.github.com>	2026-03-20 22:06:34 +01:00
Johan Fagerberg	99dbcd9383	chore: clean up README changes a little bit	2026-03-11 12:49:22 +01:00
Johan Fagerberg	a0e9594f50	chore: clean up over-explanations slightly	2026-03-11 12:49:22 +01:00
Johan Fagerberg	4d887bde74	feat: add support for image dimensions	2026-03-11 12:49:22 +01:00
Johan Fagerberg	c32cd79dc8	feat: add support for '--image-align'	2026-03-11 12:49:22 +01:00
Johan Fagerberg	dcd28068f3	Update README.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-03-11 12:47:35 +01:00
Johan Fagerberg	9516939c7d	feat: add support for --content-appearance	2026-03-11 12:47:35 +01:00
Manuel Rüger	e294a7317e	Move mention macro into a goldmark-parser/renderer	2026-02-08 00:59:43 +01:00
Manuel Rüger	b85e40402c	Fix lint	2026-01-19 11:06:06 +01:00
Manuel Rüger	517a3c76a2	Add missed template to README	2026-01-19 11:03:15 +01:00
hypengw	c1c9a13391	parse "linenumbers" in code block	2026-01-15 12:18:28 +01:00
Manuel Rüger	967efde9bd	README.md: Update help output	2025-12-08 21:59:51 +01:00
dgudim	b3a6f1efae	Update readme, add description for insecure-skip-tls-verify	2025-12-08 21:53:28 +01:00
Manuel Rüger	5fd79b897e	ci: Bump markdownlint	2025-11-28 14:18:41 +01:00
Dennis Verheijden	7f7494f26e	Run `markdown-cli2`	2025-09-15 11:07:30 +02:00
Dennis Verheijden	5516809c41	Update the documentation with a section about automatic page titles	2025-09-15 11:07:30 +02:00
Dennis Verheijden	0f13d249f5	Add support for using the filename as the page title	2025-09-15 11:07:30 +02:00
Manuel Rüger	5e2b7b64e8	Drop cloudscript support for mermaid	2025-08-13 14:04:18 +02:00
Manuel Rüger	68f84bedbd	README.md: Drop dead link to blog post	2025-08-13 10:24:28 +02:00
Manuel Rüger	4f1d68bfee	Document admonitions feature	2025-07-11 21:32:24 +02:00
Manuel Rüger	779d1791b4	Version bump to 14.0.2	2025-06-10 13:48:28 +02:00
Manuel Rüger	0618f1de60	Version bump to 14.0.1	2025-06-10 12:28:45 +02:00
Manuel Rüger	bf542ab684	fix: Config loading from file	2025-06-06 13:54:34 +02:00
Manuel Rüger	926945f884	Version bump to 13.0.0	2025-05-31 20:41:06 +02:00
Manuel Rüger	3cc39ffe79	Add support for d2lang	2025-05-30 23:37:15 +02:00
Manuel Rüger	6c33afc866	Bump version to 12.2.0	2025-04-13 00:23:10 +02:00
Manuel Rüger	203d4439ef	Bump to 12.1.2	2025-02-21 16:41:04 +01:00
Manuel Rüger	b30b0491a8	Bump to 12.1.1	2025-02-19 10:57:19 +01:00
Manuel Rüger	b2f0e80b12	Bump version to 12.1.0	2025-02-19 10:55:40 +01:00
Joris Conijn	15a3c10ed1	docs: describe how to use the emoji header	2025-02-17 17:31:04 +01:00
Joris Conijn	ec5ee6eb0a	fix: profile picture	2025-02-17 17:31:04 +01:00
Joris Conijn	ea2bae39da	style: typos	2025-02-17 17:31:04 +01:00
Joris Conijn	1a0e452910	feat: support emojis on pages Define an emoji in the markdown files and get them published as page emoji icons.	2025-02-17 17:31:04 +01:00
tiimo	f0b4d460a9	docs: fix indentation for ac:children macro arguments description	2025-02-13 09:11:08 +01:00
Manuel Rüger	c5d0a8b8b7	Bump version to v12.0.0 Some checks failed continuous-integration / ci-go-lint (push) Failing after 9m31s continuous-integration / ci-markdown-lint (push) Successful in 11s continuous-integration / ci-unit-tests (push) Failing after 7m18s continuous-integration / ci-build (push) Failing after 6m29s continuous-integration / ci-docker-build (push) Failing after 10m15s	2025-01-13 19:09:29 +01:00
Sotirios Mantziaris	f25d8876fc	Log levels support	2025-01-09 20:04:54 +01:00
Manuel Rüger	96db0f8f24	Bump version to 11.3.1	2025-01-09 15:58:54 +01:00
Yurii Myronov	3d96781f47	Support named excerpts - Resolves feature request #316	2024-11-05 12:04:11 +01:00
Manuel Rüger	649c20d4f2	Version bump to 11.3.0	2024-10-22 11:18:05 +02:00
Manuel Rüger	9eb44f95fe	Bump version to 11.2.0	2024-10-09 10:08:13 +02:00
Peter Landoll	b0f337c4a3	feat: add flag to append hash to pages to ensure unique titles	2024-10-09 00:28:45 +02:00
Manuel Rüger	2c71b50438	Bump version to 11.1.0	2024-09-26 09:58:35 +02:00
Noam Asor	035db7b7b3	To add support for github md alerts	2024-09-26 08:21:02 +02:00
Manuel Rüger	0e4d5507b0	Version bump to 11.0.1	2024-09-13 20:04:16 +02:00
Manuel Rüger	b7f17bde8b	Bump to version 11.0.0	2024-09-09 21:56:38 +02:00
Manuel Rüger	1e91fe184f	stdlib: Add multimedia macro	2024-09-09 20:40:56 +02:00

1 2 3 4

170 Commits